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SUMMARY 


This  is  the  final  report  of  Grant  AFOSR-87-0308  with  the  Air  Force  Office  of 
Scientific  Research,  which  is  also  the  continuation  of  Contract  FO8635-84-C-0273  with  the 
U.S.  Air  Force  Armament  Laboratory  at  Eglin  Air  Force  Base.  The  work  was  performed 
at  the  School  of  Electrical  Engineering  at  the  Georgia  Institute  of  Technology.  The  work 
proposed  under  this  Grant  is  continuing  under  a  new  Grant  to  Northwestern  University 
jointly  with  Georgia  Tech. 

The  research  covered  several  aspects  of  the  basic  issues  that  are  needed  to  develop 
and  implement  nonlinear  and  hybrid  systems  schemes  for  the  filtering,  tracking,  and  control 
of  maneuvering  vehicles  in  an  uncertain  and  nonlinear  geometry.  It  is  based  on  the 
approximation  of  the  original  nonlinear  problem  by  a  switched  Markov  linear  models  which 
in  turn  lead  to  hybrid  model  formulation  or  to  piecewise  linear  approximations.  Four 
aspects  are  considered:  1.  Approaches  to  handling  hybrid  systems  models;  2.  Fast  and 
slow  decomposition  for  piecewise  linear  systems;  3.  Estimation  in  the  presence  of 
impulsive  inputs  that  can  serve  as  either  models  for  the  switching  behavior  or  the  changes 
in  maneuvers;  4.  Modeling,  parameterization,  and  realization  issues  for  hybrid  systems. 
Applications  to  nonlinear  filtering  and  tracking  schemes  and  their  implementation  is  also 
addressed. 

The  research  was  performed  under  the  direction  of  A.  H.  Haddad  (now  with 
Northwestern  University)  and  E.  I.  Verriest.  Dr.  James  M.  Crowley  of  AFOSR  and  Dr. 
James  Cloutier  of  AFATL  monitored  the  Grant’s  progress. 

Five  Fh.  D.  students  were  supported  under  the  Grant:  B.  S.  Heck,  J.  Ezzine,  M.  A. 
Ingram,  P.  West,  and  S.  Gray.  The  first  three  have  completed  their  dissertation  in  August 
1988,  May  1989,  and  August  1989,  respectively. 
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SECTION  I 


INTRODUCTION 

The  objective  of  this  research  was  to  develop  nonlinear  filtering  and  tracking 
algorithms  for  systems  subject  to  complex  geometries  and  uncertainties.  These  attributes 
characterize  the  air-to-air  engagement  scenario.  The  approach  was  based  on  the 
approximation  of  the  original  nonlinear  stochastic  model  with  a  piecewise  linear  model. 
Then  the  resulting  model  was  further  approximated  by  a  switched  Markov  linear  model. 
The  result  is  a  dynamic  system  of  the  form 

X(t)  =  A[r(t)]  X(t)  +  B[(r(t)j  U(t)  (la) 

Y(t)  =  C[r(t)l  X(t)  +  V(t)  (lb) 

where  the  spte  vector  is  X(t),  the  observation  vector  is  Y(t),  U(t)  can  serve  as  the  control 
vector  when  the  control  problem  is  considered,  or  can  serve  as  the  process  noise  model  for 
the  filtering  problem,  V(t)  is  the  observation  noise  vector.  The  noise  processes  are  assumed 
to  be  white  and  Gaussian.  The  process  r(t)  is  called  either  as  the  form  index,  the  switching 
process,  or  the  macro-state  process,  and  is  assumed  to  be  a  finite  state  Markov  process 

taking  the  values  in  {1,2 . N}.  The  approximation  is  via  what  is  known  as  either  switched 

Markov  models  or  hybrid  systems.  1116  linear  system  in  such  a  case  switches  among  the 
forms  (A[i],B[i],C[i])  according  to  the  value  of  r(t),  i.e.,  when  the  macro-state  is  equal  to 
i. 

In  earlier  reports  the  validity  of  the  approximation  has  been  analyzed  as  discussed 
in  reference  1,  and  its  applications  to  nonlinear  filtering  have  been  investigated  as  provided 
in  References  2  and  3.  This  report  addresses  several  aspects  of  the  resulting  approximate 
model  and  general  approaches  to  its  estimation,  realization,  and  control.  The  main  report 
is  subdivided  into  four  major  sections.  Section  II  addresses  the  general  properties  of  hybrid 
systems  from  the  point  of  view  of  control  and  stabilization.  Section  III  addresses  fast  and 
slow  decomposition  of  the  original  piecewise  linear  approximation  with  the  view  of 
simplifying  the  resulting  algorithms.  Section  IV  addresses  an  alternative  model  for  the 
jumps  representing  the  maneuvers  and  develops  approximate  nonlinear  filtering  algorithms 
for  these  models.  Section  V  discusses  sev>  u  issues  resulting  from  the  realization  of  such 
systems  as  they  affect  sensitivity,  robustness  ’  -  _  identification.  The  body  of  each  section 
will  be  relatively  short,  as  the  results  are  pro*. Jed  in  appropriate  appendices. 
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SECTION  II 


HYBRID  SYSTEM  MODELS 

Since  the  approximation  to  the  original  nonlinear  model  is  represented  by  the  hybrid 
model  (1),  a  major  part  of  the  study  dealt  with  control  and  stabilization  properties  of  hybrid 
systems.  General  properties  of  controllability  and  observability  of  such  models  are  given 
in  Reference  4  and  provided  also  in  Appendix  A.  These  properties  carry  over  from  the 
linear  time-invariant  case  and  stress  the  simplification  of  the  algorithms  used  for 
controllability,  observability,  and  stability.  Usually,  these  system  models  switch  among 
several  realization.  An  important  issue  to  consider  is  the  ability  to  represent  such  models 
by  an  average  model.  Such  an  average  model  may  be  suitable  under  certain  condition,  or 
under  cases  where  the  switches  may  be  fast.  The  use  of  such  averaging  methods  can 
simplify  the  resulting  control  and  filtering  algorithm.  Several  averaging  procedures  for  the 
stabilization  of  hybrid  systems  are  reported  in  Reference  5  and  provided  in  Appendix  B. 
Two  properties  of  the  average  system  are  investigated  in  References  6  and  7,  and  are  given 
in  Appendix  C  and  D.  The  first  considers  the  error  that  results  from  averaging  and  how 
to  determine  the  validity  of  the  use  of  the  average  model  The  second  considers  the 
minimality  properties  of  the  average  systems  that  would  allow  the  stabilization  of  the 
original  system  by  using  the  average  model.  The  main  advantage  for  using  average  models 
is  that  there  is  no  need  to  identify  the  macro-state  and  the  resulting  algorithms  are  rather 
simple.  Of  course,  the  average  model  can  replace  the  original  system  only  under  restricted 
conditions.  The  iast  aspect  of  hybrid  systems  considered  in  this  problem  is  concerned  with 
eigenvalue  assignment  for  hybrid  system  models,  which  in  this  case  deals  with  the  Lyapunov 
exponents.  The  result  is  given  in  Reference  8  and  Appendix  E.  The  largest  Lyapunov 
exponent  determine  the  stability  of  such  systems,  and  its  assignment  using  control  gains 
determines  the  ability  to  stabilize  such  systems. 

Other  aspects  of  hybrid  systems  dealing  with  realization  and  its  relationship  to 
implementation  and  filtering  is  provided  in  Section  V. 


3 


SECTION  m 


FAST  AND  SLOW  DECOMPOSITION 

The  approximation  used  to  model  the  nonlinear  systems  exhibited  fast  and  slow 
behavior  both  in  the  switching  process  and  in  each  individual  realization.  Such  fast  ad  slow 
behavior  can  lead  to  simplification  of  the  resulting  algorithms  due  to  two-time  scales 
decomposition  and  to  reduced  order  of  the  filters-controllers  via  aggregation.  The  theory 
of  singular  perturbation  which  has  been  developed  to  deal  with  such  behavior  has  been 
restricted  to  smooth  systems.  In  our  case  the  switches  and  the  piecewise  linear  models  lead 
to  difficulties  that  require  an  extension  to  the  standard  linear  theory.  This  section  treats 
the  singular  perturbation  theory  for  non-smooth  systems  with  fast  and  slow  modes.  In 
particular  it  extends  the  theory  developed  in  Reference  9  for  quantized  systems  to  general 
piecewise  linear  models.  The  piecewise  linear  models  is  considered  in  Reference  10  and 
provided  also  in  Appendix  F.  Usually  sliding  modes  occur  in  such  models  in  both  the  fast 
and  slow  dynamics.  Reference  11  (also  given  in  Appendix  G)  discusses  the  conditions  for 
the  existence  of  the  sliding  modes  and  how  the  algorithm  can  handle  the  resulting 
complications.  Two  additional  extensions  of  the  theory  are  given  in  References  12  and  13, 
which  are  also  provided  in  Appendix  H  and  I  respectively.  The  first  extends  the  theory  to 
the  case  of  stochastic  input  as  most  of  our  models  are  subject  to  random  inputs.  The 
second  extends  the  quantized  system  to  the  vector  quantization  case.  The  quantization 
problem  is  of  interest  in  this  case  due  to  the  fact  that  with  the  high  order  of  the  filter  used 
in  the  original  filtering  problem,  it  is  appropriate  to  use  only  a  few  quantization  levels  to 
reduce  the  computational  complexity  of  the  problem.  In  earlier  reports  the  quantization 
aspect  was  covered  by  an  approximate  stochastic  differential  equation  model  with  state 
dependent  noise. 
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SECTION  IV 


FILTERS  FOR  POISSON  DRIVEN  MODELS 

This  section  considers  an  alternative  approach  to  the  modeling  of  the  switching 
jumps  that  affects  the  systems.  In  particular  it  considers  a  self-excited  Poisson  model  as  an 
input  to  the  system.  These  self-excited  inputs  may  represent  varying  maneuvers  and  or 
control  actions  that  affects  the  target.  It  is  well-known  that  the  linear  filter  for  such  models 
is  not  optimal.  It  is  difficult  to  derive  such  a  linear  filter  for  the  case  where  the  average 
of  the  input  jumps  is  not  zero.  The  study  first  considered  several  alternatives  as  suboptimal 
nonlinear  detection-estimation  schemes  to  solve  the  problem.  These  are  summarized  in 
References  14  and  15  and  provided  in  Appendix  J  and  K.  The  properties  of  the  model  and 
-the  derivation  of  the  appropriate  linear  filters  for  such  models  are  considered  in  References 
16  and  17  and  provided  in  Appendix  L  and  M.  Simulation  results  and  the  derivation  of  the 
error  properties  of  the  resulting  approximate  filters  are  still  being  investigated. 
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SECTION  V 


REALIZATION,  ROBUSTNESS,  AND  SENSITIVITY 

This  section  addresses  several  aspects  of  hybrid  systems  modeling  with  particular 
emphasis  to  realization  and  robustness  as  they  affect  the  accuracy  and  sensitivity  of  the 
implementation  used  for  the  filter. 

Work  on  optimal  realizations  of  such  systems  progressed  in  two  directions:  earlier 
results  showing  the  optimality  of  the  balanced  realizations  (see  Reference  18),  in  the 
discrete  time  case  have  been  extended  to  the  continuous  time  case  and  is  given  in 
Reference  19  and  Appendix  N.  The  results  have  also  been  extended  to  multi-mode  systems 
(see  Reference  20  and  Appendix  O),  and  general  time-varying  systems  as  given  in 
Reference  21  and  Appendix  P.  In  these  references  applications  to  filtering  have  been 
analyzed,  and  Reference  20  also  addressed  the  optimal  implementations  of  the  suboptimal 
nonlinear  filters  for  the  switched  Markov  models.  Optimality  conditions  for  non¬ 
infinitesimal  perturbations  have  been  given  as  well.  Conditions  for  optimality  over  finite 
sets  have  been  applied  to  the  parameterization  of  3-D  rotations  in  Reference  22  and 
Appendix  Q. 

Realization  problems  for  hybrid  systems  (reachability  and  observability)  for 
generalized  systems  have  been  solved.  More  specifically,  results  for  N-periodic  systems  have 
been  reported  in  References  23  and  24  and  are  provided  in  Appendix  R  and  S. 

In  addition  to  the  realization  point  of  view  the  sensitivity  of  analog  algorithms  were 
investigated  from  a  parameter  sensitivity  point  of  view.  In  particular,  a  discussion  of  optical 
analog  computing  devices,  for  matrix  computations  was  presented  in  Reference  25,  and  a 
wider  collection  of  devices  were  analyzed  in  Reference  26. 


SECTION  VI 


CONCLUSION 

The  research  addressed  several  basic  aspects  of  filtering,  and  control  for  nonlinear 
and  hybrid  models.  These  models  may  be  used  to  approximate  the  nonlinear  environment 
and  other  uncertainties  in  air-to-air  engagement.  Research  is  continuing  on  the  integration 
of  these  approaches  and  in  the  implementation  algorithms  that  could  lead  to  a  filter  tat  is 
applicable  to  a  realistic  system. 
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ABSTRACT 


This  paper  considers  a  special  class  of 
hybrid  systems,  whose  state  space  is  a  cross- 
product  space  of  an  Euclidean  space  and  a  finite- 
state  space.  Such  models  may  be  used  to  rep¬ 
resent  systems  subject  to  known  abrupt  parameter 
variations,  such  as  commutated  networks.  They 
may  also  be  used  to  approximate  some  types  of 
time-varying  systems.  The  paper  investigates 
controllability,  observability,  and  stability  of 
hybrid  systems.  In  particular,  it  derives  a 
necessary  and  sufficient  algebraic  condition,  a 
simple  algebraic  criterion,  and  a  computationally 
simple  algebraic  sufficient  test  for  controll¬ 
ability  and  observability.  Moreover,  it  provides 
a  simple  sufficient  stability  condition. 

1 .  Introduction  and  Problem  Formulation 


This  paper  examines  the  controllability, 
observability  and  related  issues  of  a  special 
class  of  hybrid  systems  [1,  2).  The  state  space 
of  a  hybrid  system  is  a  cross-product  space  of  an 
euclidien  space  and  a  finite-state  space. 
Basically,  hybrid  systems  are  linear  piece-wise 
constant  time-varying  systems,  which  are  swit¬ 
ching  among  a  finite  number  of  constant  reali¬ 
zations.  Systems  of  this  type  can  be  used  to 
model  synchronously  switched  linear  systems  [?), 
networks  with  periodically  varying  switches  [4], 
and  systems  subject  to  failures  [1].  Even  though 
hybrid  systems  are  time-varying  they  lend 
themselves  to  a  precise  and  complete  qualitative 
and  quantitative  analysis.  Among  such  results  we 
mention  the  possibility  to  explicitly  compute 
their  transition  matrices,  to  derive  and  state 
necessary  and  sufficient  conditions  for  their 
stability,  and  the  possibility  to  derive  an 
algebraic  controllability/observability  tests 
similar  to  the  usual  one  for  linear  time-in¬ 
variant  systems.  This  is  possible  due  to  the 
■any  features  hybrid  systems  share  with  time- 
invariant  systems.  Moreover,  because  they  are 
****"v*rylnE>  they  offer  many  useful  features  due 
to  their  variable  structure  property.  In  other 
words,  hybrid  systems  are  a  mixture  of  time- 
nvarlant  systems  with  which  they  share  the 
*  **  r*ic  •n<i  geometric  structures,  and  time- 
^  systems  with  which  they  share  their 
■tr'»«ure  property  that  will  be  useful 
“  lh*lr  control  and  stabilization. 

""  systems  considered  in  this  paper 

are  assumed  to  have  the  fora 


x(t)  -  A(r(t))x(t)  +  B(r( t) )u( t )  (1.1) 

y(t)  -  C(r( t ) )x( t )  (1.2) 

where  x  is  the  system  state  vector  of  dimension 
n,  u  is  the  control  input  vector  of  dimension  p, 
y  is  the  output  vector  of  dimension  m,  and  r(t) 
is  the  "form  index"  which  is  a  deterministic 
scalar  sequence  taking  values  in  the  finite  index 
set  >M1,  2 . N). 

This  type  of  model  can  be  used  to  represei t 
systems  subject  to  known  abrupt  parameter 
variations  such  as  commutated  networks  or  to 
approximate  some  types  of  time- varying  systems. 
This  is  done  by  imposing  a  "deterministic" 
switching  rule  on  the  time  behavior  of  the  form 
index.  However,  in  order  to  model  unknown  abrupt 
phen^zena  such  as  component  and  interconnection 
failures  the  form  index  can  be  modeled,  for 
example,  as  a  finite-state  Markov  chain. 

The  latter  problem  has  received  considerable 
attention  within  the  control  community,  but  many 
important  generalizations  remain  to  be  worked 
out.  Chizeck  et  al  [1]  denote  such  a  control 
problem  the  Jump  Linear  Quadratic  (JLQ)  problem 
since  they  view  it  as  an  extention  of  the 
standard  Linear  Quadratic  (LQ)  problem.  However, 
very  little  attention  was  given  to  the  ’eter- 
ministic  version  of  the  problem,  even  thou  h  it 
shares  many  features  with  the  JLQ  problem,  "his 
paper  is  concerned  with  the  deterministic  ve-  ion 
of  the  problem. 


Let  denotes  any  sequence  of 
the  values  taken  by  r(t),  and  let  fit^ 
time  interval  during  which  r(t)  -  i. 
the  paper  the  following  assumption  is 
SN  contains  all  the  values  that  r(t) 
this  case  we  define 


length  M  .  f 
denotes  t  .? 
Througho* 
made,  tha 
takes.  Ir. 


N 

T  5  I  4tj  (2) 

i-1 

as  the  period  of  the  system.  If  in  addition  the 
sequence  in  every  Sjj  is  the  same  the  system  is 
called  a  periodic  hybrid  system.  It  will  be 
obvious  that  the  assumption  that  MSN  in  SM  will 
not  affect  the  results.  Hence,  the  assumption 
that  M  •  N  will  be  siade  to  simplify  the  nota¬ 
tions. 


im...  .  research 
W**)i-**-C-0273  (with 


is  supported  by  the  U.S.  Air  Force  under  contract 
the  Anamept  Laboratory)  and  grant  AFQSR-87-0308. 


41 


The  system  taka*  the  realization 
E1«(A1,Bi,C1)  whan  r(t)  ■  i,  with  id.  Thia 
raalization  ia  called  tha  ith  fora. 

Tha  following  la  an  outlina  of  tha  papar. 
Saction  2  dlscuaaaa  tha  atablllty  of  hybrid 
aystaaa  where  a  staple  sufficient  atability 
criterion  la  derived.  The  observability  and 
controllability  of  periodic  hybrid  systems  ara 
treated  ir  Sections  3  and  4,  respectively. 
Algebraic  -coservabillty  and  controllability  tests 
are  obtained.  Section  S  extends  the  results  of 
Sections  3  and  4  to  general  hybrid  systems.  In 
section  6  the  stabllizability  of  hybrid  systems 
is  addressed  and  a  simple  application  is  used  for 
illustration  purposes.  Section  7  concludes  the 
paper. 

2.  Stability 


important  to  note  that  tha  above  theorem  is 
stated  not  only  for  periodic  hybrid  systems  but 
it  applies  to  tha  more  general  hybrid  systems  as 
defined  above  too. 

3.  Observability 

Since  hybrid  systems  are  a  special  class  of 
time-varying  systems  they  display  interesting 
properties  relative  to  controllability  and 
observability.  It  would  be  appropriate  to  define 
the  latter  properties  while  keeping  in  mind  the 
fact  that  these  systems  are  variable  structure 
systems.  We  start  with  tha  observability 
criterion  since  it  is  simpler  to  prove.  Conse¬ 
quently,  the  dual  co  itrollability  criterion  is 
stated  by  appealing  to  the  duality  principle. 

Definition 


Even  though  hybrid  systems  are  time-varying 
systems  it  is  possible  to  obtain  necessary  and 
sufficient  asymptotic  stability  conditions.  We 
start  by  studying  the  stability  of  periodic 
hybrid  systems.  To  this  end  we  recall  a  theorem 
by  Willems  [S]  that  provides  a  necessary  and 
sufficient  conditions  under  which  piece-wise 
constant  periodic  systems  are  uniformly  asympto¬ 
tically  stable.  Basically,  the  theorem  states 
that  for  the  system  to  be  asymptotically  stable 
its  transition  matrix  over  one  period  of  time  has 
to  be  a  contraction.  This  theorem  can  be 
obviously  modified  to  derive  a  similar  one  for 
hybrid  systems  which  are  not  necessarily  period¬ 
ic.  However  the  resulting  theorem  will  be 
difficult  to  use,  since  one  has  to  ccmpute 
(N-l)NI  products  of  exponential  matrices  and 
check  their  eigenvalues. 

In  order  to  derive  simpler  conditions  to  test 
for  the  stability  of  such  systems,  a  different 
norm  is  defined,  namely  the  logarithmic  norm  [6, 
7].  The  result  is  a  simpler  condition  that  is 
sufficient  only. 


A  periodic  hybrid  system  is  said  to  be 
observable  if  there  exists  some  finite  tf  i  t0+T 
such  that  the  initial  state  x(t0)  of  the  unforced 
system  can  be  determined  from  the  knowledge  of 
y(t)  on  [t0,tf]. 

Using  the  above  definition  it  is  possible  to 
state  an  algebraic  necessary  and  sufficient 
observability  criterion  very  similar  to  the  usual 
algebraic  test.  Moreover  this  algebraic  test  is 
expressed  as  a  function  of  the  observability 
matrices  of  the  different  forms.  This  condition 
is  a  generalization  of  the  well  known  algebraic 
observability  test. 

Theorem  2 

A  periodic  N-form  hybrid  system  is  observable 
if  and  only  if  the  observability 

matrix 


°1 

02Exp(A1(6t1)) 


Definition 


(6) 


The  logarithmic  norm  of  a  matrix  A  associated 
with  the  matrix  norm  l.l  is  defined  by 


OfjExp(Afj.j(dtfj.j))  . . . Exp( A^(dtj ) ) 


u(A)  -  lim  (II  +  hAI  -  l)/h  (3) 

h-*Q+ 


has  full  rank,  where  Of  is  the  observability 
matrix  of  the  1th  form,  lcN. 


The  norm  satisfies  the  following  inequality 


Proof 


IExp(At) I  4  Exp(u(A)t).  (4) 

This  norm  is  now  used  to  derive  the  stability 
condition. 

Theorem  1 


For  the  null  solution  of  the  hybrid  system 
(1)  to  be  uniformly  asymptotically  stable,  it  is 
sufficient  to  have 

E  uU^Pi  <  0,  pt  *  dti/T  -  (ti-t^^/T,  icM.  (5) 
1 

The  proof  is  a  simple  application  of  the 
logarithmic  norm  to  Willems'  theorem.  It  is 


Let  us  assume  that  the  system  is  in  its  ith 
form  at  time  tcttj.t^+i]  then  the  output  is  given 
by  the  following  expression 


1 

y(t)  *  ciExp(A1(t-ti))  n  Exp(Aj (atj ) )x( t0 ) . 


(7) 


We  now  take  n-1  derivatives  of  y(t)  in  (7)  and 
arrange  them  in  a  column  vector  Yt(t)  *  [y  y^ 
y(‘)  . ..y'0'1')'  which  may  be  expressed  as 


1 

Yt(t)  -  OiExpUiU-ti))  H  Exp( Aj (atj ) )x( t„ )  (8) 
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where  0,  Is  the  observability  matrix  of  the  ith 
form.  If  the  same  procedure  is  repeated  for  all 
icN  and  combined  together  the  following  equation 
results 


M 

■ 

OiExp(Ai(t-t0 ) ) 

*2 

02Exp(A2(t-t1))Exp(A1<5t1) 

yn 

0NExp(AN(t-tN.1)). . •Exp(A16t1) 

From  this  point  on  the  proof  is  identical  to  a 
standard  textbook  [B;  pp.  354]. 

4.  Controllability 

At  this  point  the  dual  algebraic  controll¬ 
ability  test  is  introduced.  First  a  dual 
definition  for  controllability  is  proposed  and 
used  along  with  the  algebraic  observability  test 
to  prove  the  result  via  the  duality  principle. 

Definition 

A  hybrid  system  is  said  to  be  state-controll¬ 
able  if  for  any  t0  each  state  x(t0)  can  be 
transferred  to  any  final  state  Xf  after  one 
period.  Thus  there  exists  a  tj,  t,,+T  S  tf  <  • 
such  that  x(tf)  •  Xf. 

Before  presenting  the  algebraic  controll¬ 
ability  criterion,  the  dual  to  the  observability 
criterion  given  above,  the  usual  controllability 
test  for  time-varying  systems  is  used.  This  is 
done  in  order  to  display  certain  interesting 
properties  of  hybrid  systems.  If  we  compute  the 
controllability  grammian  and  use  the  fact  that 
the  system  is  piece-wise  constant  we  obtain  the 
following  theorem. 

Theorem  3 

A  periodic  hybrid  system  of  N  forms  is 
controllable  if  and  only  if 

N  tt 

W(t,,t0+T)  «  £  /  ♦i(t.t0)BiBi*i(t,t0)dt  (10) 

i-1  tw 

has  full  rank. 

Corollary 

A  periodic  hybrid  system  is  completely 
controllable  if  and  only  if  it  is  controllable. 

Proof 

See  Remark  (2.18)  in  [9],  then  use  the 
Theorem  3. 

Befor  proceeding  any  further,  a  necessary  and 
sufficient  condition  for  a  periodic  hybrid  system 
to  be  uniformly  completely  controllable  is 
stated.  This  result  will  be  of  importance  when 
stabilisability  of  such  systems  is  in  question. 


Theorem  4 

A  periodic  hybrid  system  is  uniform!-.- 
completely  controllable  if  and  only  if  it  is 
completely  controllable. 

Proof 

If  the  periodic  system  is  completely  con¬ 
trollable,  there  oust  exist  a  finite  time  s2  T 
such  that  W(0,s)  &  cl  >  0.  Therefore  the  result 
is  proved  by  using  Lenma  1  from  Silverman  et  al 
[10]  and  Remark  (2.18)  in  [9]. 

Having  used  the  usual  test  we  are  ready  to 
present  an  algebraic  controllability  test  similar 
to  the  one  used  in  linear  time-invariant  systems. 
The  following  criterion  applies  for  periodic 
hybrid  systems.  A  similar  criterion  for  general 
hybrid  systems  will  be  introduced  in  a  later 
section. 

Theorem  5 

A  periodic  hybrid  system  of  N  forms  is 
controllable  if  and  only  if  the  controllability 
matrix 

t Cjg ,  Exp(Ajj(6tfj) )Cjj. j ,  ..., 

Exp( An( atN_ !))... Exp(A2( it2 ) )Cj ]  (11) 

has  full  rank,  where  is  the  usual  controll¬ 
ability  matrix  of  the  1th  form,  IcN. 

Proof 

Using  the  principle  of  duality  and  the 
algebraic  observability  theorem  presented  above 
proves  the  theorem. 

For  computational  purposes,  it  is  better  to 
rewrite  the  above  controllability  matrix  as 
follows 

(C|j,  Exp(Afl(6tN) ) ,  ...{C4, 

Exp(A3(St3))(C2,  Kxp(A2(dt2))Cj) ] .  (12) 

This  way  one  does  not  have  to  compute  all  of  the 
matrices  needed  to  express  (11)  and  compute  its 
rank.  That  is  the  rank  is  checked  sequentially 
and  (12)  is  augmented  appropriately  until  full 
rank  is  achieved.  If  full  rank  can  not  be 
achieved  throughout  this  sequential  test  then  the 
system  is  not  controllable.  The  same  observation 
applies  to  the  observability  criterion. 

In  addition  to  the  above  algebraic  criteria 
for  controllability  and  observability,  two  more 
tests  are  introduced.  The  first  test  is  a  simple 
and  geometrically  and  computationally  attractive 
necessary  algebraic  test.  The  second  one  is  a 
simple  algebraic  sufficient  condition. 

Theorem  6 

A  necessary  algebraic  condition  for  a  hybrid 
system  to  be  controllable  is 

rank[C2,  C2,  ....  Cfj]  *  rank  C  ■  n.  (13) 


43 


££'  >  0,  i.e.  it 


Where  Is  the  controllability  matrix  for  the 

ith  form,  icH. 

Proof 

We  write  the  state  of  the  system  at  time  s, 
for  x(t,)  -  0: 


x(s)  -  /  *(s,x)B(x)dx.  (14) 

t  s 

We  now  use  the  fact  that  the  system  is  piece-wise 
constant  and  the  linearity  property  of  the 
integral  operator  to  obtain  for  s  ■  tjj 

x(tN)  ■  Exp(A^dtg). . .Exp(A26t2) 


/  Exp(A1(t1-r))B1u(x)dx  +. . . 
f  « 

tty„  1 

+  Exp(Afjdtfj)  I  Exp(Afj.1(tN.1-x))BN.1u(x)dT 
*N-2 

tty 

+  /  Exp(AN(tfj-x))BNu(x)dx.  (IS) 

CN-1 

After  expanding  the  exponential  matrices  inside 
every  integral,  it  is  found  that  x(t(])  is  an 
element  of  the  column  range  space  of  the  control¬ 
lability  matrix  £  given  in  Theorem  5.  Moreover, 
it  is  easy  to  see  that 

rank  £  i  rank  C  i  n  (16) 

an  inequality  that  dictates  that  full  rankness  of 
C  is  a  necessary  condition  for  our  system  to  be 
controllable. 

The  above  proof  gives  an  alternate  way  to 
prove  the  necessity  part  in  Theorem  5.  It  is 
also  interesting  to  note  that  this  latter  test  is 
independent  of  the  £^'s  order.  This  order 
independence  would  have  been  very  beneficial, 
however  it  does  not  hold  in  the  sufficiency  part 
of  the  proof. 

Now  we  state  a  theorem  that  gives  a  simple 
sufficient  algebraic  test.  With  the  above  simple 
necessary  test  this  condition  will  provide  an 
efficient  algebraic  method  to  test  for  the 
controllability/observability  of  hybrid  systems. 
This  theorem  is  adapted  from  a  theorem  given  in 

[HI. 

Theorem  7 

A  sufficient  condition  for  a  periodic  hybrid 
system  to  be  controllable  is 

ranktBj!),  Exp(A^(6t[j) )Bjj_ j ,  ..., 

Exp(AN(6t(j-i)).  •  .Exp(A2(6t2))Bi] 

*  rank  C  ■  n. 


Proof 

Since  £  has  full  rank  then 
is  positive  definite.  Also 

C(*l  ,»2* • • • »sn)^' (*1 ,*2‘ • • ' ,sn)  " 

N 

£  *(sk,t0)BkB£*,(sk,t0)  (18) 

k-1 

where  skc(tk,tk. j ] .  Then  we  have 

tty 

W(t„,tN)  -  /  $(s,t0)B(s)B'(s)$(s,ta)d3 
f  « 

N  sk±o 

11(1/  ♦(s, ta )B(s)B' (s)*(s,t„ )ds) 

k"l  sk 

-  o£(sl,S2>. •• ,sN,t0 )£'(*! ,S2,- • ..SN,t0) 

+  o(o),  (19) 

for  o  sufficiently  small.  If  we  assume  that  £ 
has  full  rank  then  for  o  small  enough  (19)  is 
positive  definite.  But  then  (19)  implies  that 
W(tTJ,t6)  >  0  which  proves  the  theorem. 

5.  Aperiodic  Hybrid  Systems 

In  this  section  we  generalize  the  above 
results  stated  for  period 'c  hybrid  systems  to 
more  general  aperiodic  hybrid  systems.  Neverthe¬ 
less,  many  of  these  results  apply  to  general 
hybrid  systems  without  modification.  Therefore 
we  will  state  only  the  most  important  results. 

Theorem  8 

A  hybrid  system  is  controllable  if  and  only 
if  Theorem  5  holds  for  all  possible  N!  permuta¬ 
tions  of  the  form- index  set  N. 

Theorem  9 

A  hybrid  system  is  controllable  if  Theorem  7 
holds  for  all  possible  Nl  permutations  of  the 
Index-set  N. 

It  is  obvious  that  Theorem  6  applies  for 
general  hybrid  systems  too.  Moreover,  Theorem  6 
may  also  be  sufficient  under  very  general 
conditions.  A  heuristic  argument  can  be  given  as 
follows:  Since  any  matrix  exponential  is  a 

perturbation  of  the  identity  matrix  it  follows 
that  multiplying  any  matrix  with  matrix  exponen¬ 
tials  will  not  change  its  range  space  dramatical¬ 
ly.  That  is  if,  for  example,  Cj  and  C2  have 
algebraic  complementary  range  spaces  (i.e, 
range(C})  is  perpendicular  to  range(C2))  then 
range(Exp(AT)Cj)  will  almost  always  remain  an 
algebraic  complement  but  not  necessarily  perpen¬ 
dicular  to  range(C2)<  As  a  matter  of  fact, 
Mariton  [2]  states  that  he  has  proved  that 
Theorem  6  is  also  a  sufficient  condition. 


(17) 


6.  Stabilizabllitv 

This  section  presents  some  results  concerning 
the  control  and  stabilization  of  hybrid  systems. 
These  results  use  standard  techniques  to  control 
and  stabilize  hybrid  systems. 

Ikeda  et  al  [12]  looked  at  the  relation 
between  controllability  properties  of  the  system 
and  various  degrees  of  stability  of  the  closed 
loop  system  resulting  from  linear  state  variable 
feedback.  Their  results  are  as  follows:  For  any 
initial  time  t0,  and  any  continuous  and  aonotoni- 
cally  nondecreasing  function  6 C . , 1 0 )  such  that 
6(t0,t0)“0,  the  transition  matrix  &( . , . )  of  the 
closed  loop  system  can  be  made  to  satisfy 

I4’(t,t0)l  S  a(t0 )Exp{-S( t , t0 )}  for  all  tSt0, 

if  and  only  if  the  system  is  completely  controll¬ 
able.  Furthermore,  in  case  of  a  bounded  system, 
for  any  m  S  0,  a  bounded  feedback  matrix  can  be 
found  such  that  the  transition  matrix  of  the 
closed  loop  system  is  made  to  satisfy 

l<t>( 1 2 , t x ) »  £  aExpf-mCtj-tj)}  for  all  tj,  tj*^, 

if  and  only  if  the  system  is  uniformly  completely 
controllable.  Thus,  their  results  can  be 
regarded,  in  some  sense,  as  extensions  of  the 
well  known  results  of  closed  loop  pole  assignment 
for  time- invariant  systems. 

Therefore  there  is  a  high  degree  of  flexibi¬ 
lity  in  the  stabilization  of  hybrid  systems  if 
they  are  either  completely  controllable  or 
uniformly  completely  controllable. 

As  an  illustration  of  the  above  result,  a 
procedure  is  proposed  to  stabilize  a  periodic 
hybrid  system  via  state  feedback  when  all  of  the 
forms  are  minimal.  This  design  procedure  allows 
the  designer  to  impose  or  choose  an  upper  bound 
on  the  norm  of  the  transition  matrix  of  the 
hybrid  system  to  be  stabilized.  Thus  the  norm  of 
the  transition  matrix  for  hybrid  systems  plays  a 
role  similar  to  the  maximum  overshoot  and  time 
constants  in  linear  time- invariant  systems. 

In  order  to  impose  an  upper  bound  on  the  norm 
of  the  transition  matrix  a  known  stability 
criterion  [5]  is  used:  The  null  solution  of  (1) 
is  uniformly  asymptotically  stable  if  and  only  if 
there  exists  two  positive  constant  and  c2 
such  that 

l*(t,t0)l  i  c1Exp(-c2(t-t0))  (20) 

or  all  t  2  0.  Therefore  the  use  of  Theorem  1 
leads  to  the  following  design  criterion 

E  M(A1)ati  S  k*  -  k2T  (21) 
i 

where  kj  ■  ln(cj)  and  T  is  the  period  of  the 
hybrid  system.  The  kt's,  i«l ,  2,  are  the  design 
parameters  that  are  chosen  according  to  the 
specifications  on  the  upper  bound  of  the  transi¬ 
tion  matrix  of  the  closed  loop  system  and 
consequently  reflect  the  desired  time  response  of 
the  system.  This  is  possible  whenever  (21)  is 


achievable.  Consequently,  (21)  can  be  always 
obtained  via  state  feedback  since  every  form  is 
observable.  It  is  important  to  note  that  this 
design  procedure  applies  to  both  periodic  and 
aperiodic  hybrid  systems.  It  should  be  noted 
that  the  minimality  condition  for  every  form  is 
not  necessary  to  achieve  such  a  design. 

7.  Conclusion 

This  paper  considered  a  special  class  of 
linear  piece-wise  constant  time-varying  systems. 
These  systems  are  called  hybrid  systems  because 
the  set  of  linear  time-invariant  systems  among 
which  the  systems  are  switching  is  finite.  Their 
state  space  thus  contains  both  continuous  and 
discrete  components. 

Since  hybrid  systems  share  several  features 
with  linear  time- invariant  systems  it  was 
possible  to  derive  the  following  results:  A 

necessary  and  sufficient  stability  condition  and 
a  simple  sufficient  criterion.  Algebraic 
necessary  and  sufficient  controllability- 
observability  tests  similar  to  the  usual  time- 
invariant  tests.  An  interesting  necessary 
controllability-observability  condition  which  may 
also  be  sufficient,  along  with  a  simple  suffi¬ 
cient  condition. 

The  necessary  controllability/observability 
condition  is  a  flat  block  matrix  composed  from 
the  controllability/observability  matrices  of 
every  form  which  makes  it  independent  of  the 
switching  order.  This  order  independence  along 
with  the  fact  that  the  condition  is  "almost" 
sufficient  make  it  a  very  useful  test.  Therefore 
identifying  the  class  of  hybrid  systems  for  which 
this  condition  is  necessary  and  sufficient  would 
be  an  interesting  problem. 

Additional  work  is  needed  concerning  stabil¬ 
ity  theory  of  this  class  of  systems.  The 
variable  structure  property  seems  to  be  a 
promising  feature  in  this  direction.  In  addition 
if  one  thinks  of  every  system  Ej*(A^,B^,Ci)  with 
ieN  as  an  operator  acting  on  the  state  x  during 
dt£,  and  these  operators  are  applied  in  a 
successive  manner,  then  this  process  can  be 
viewed  as  an  iterative  process  [13].  Viewing  a 
hybrid  system  as  an  iterative  process  sheds  some 
light  on  some  complicated  issues  such  as  the 
stability  of  such  systems. 

Finally  adapting  the  results  of  this  paper  to 
hybrid  systems  where  the  switching  is  a  stochas¬ 
tic  process  such  as  a  Markov  chain  may  be  useful. 
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ABSTRACT 

This  paper  discusses  some  practical  methods  of 
analysis  end  control  of  two-fora  hybrid  systems. 
These  systems  are  called  hybrid  because  their  state 
space  contains  both  continuous  and  discrete 
components.  These  are  systems  that  switch  among  a 
finite  number  of  linear  time -invariant  realiza¬ 
tions.  Such  models  may  be  used  to  represent 
systems  subject  to  known  abrupt  parameter  varia¬ 
tions  such  as  cnemntated  networks  or  to  approximate 
borne  types  of  time-varying  systems.  This  paper 
restricts  the  analysis  to  systems  that  switch  among 
only  two  possible  linear  models. 

1.  Introduction  and  Problem  Formulation 

This  paper  discusses  some  practical  methods  of 
analysis  and  control  of  two-form  hybrid  systems. 
Hybrid  systems  denote  a  special  class  of  piece-wise 
constant  time-varying  systems.  The  set  of  constant 
realizations  among  which  tha  model  is  switching  is 
finite  and  in  this  paper  is  restricted  to  two. 
Such  systems  can  be  used  to  model  synchronously 
switched  linear  systems  [ 1 ] ,  networks  with  period¬ 
ically  varying  switchas  [2],  and  systems  subject  to 
failures  [3].  In  particular  we  examine  the 
stabilization  and  related  issues  of  two-form  hybrid 
systems  via  a  special  averaging  technique. 
Averaging  thaory  may  be  used  in  either  determinis¬ 
tic  or  probabilistic  contexts.  In  the  probabilis¬ 
tic  case  averaging  is  introduced  in  a  natural  way 
by  Caking  expected  values.  In  the  deterministic 
case,  however,  averaging  is  introduced  via  perturb¬ 
ation  techniques.  Averaging  methods  have  received 
considerable  attention.  Brockett  and  Wood  [2]  used 
a  deterministic  averaging  technique  to  analyse  and 
atablllze  a  class  of  bilinear  systems  which  are 
difficult  to  analyse  or  control  otherwise.  Geman 
[A]  used  probabilistic  averaging  techniques  to 
study  the  stability  of  random  differential  equa¬ 
tions.  His  main  interest  was  to  explore  the 
relation  between  asymptotic  stability  in  the 
average  equation,  and  asymptotic  stability  in  the 
random  equation;  Specifically,  when  does  tha  first 
imply  the  second?  Cosut  et  al  {5]  applied  the 
theory  of  averaging  to  the  analysis  of  tha  stabil¬ 
ity  of  adaptive  systems. 

Evan  though  hybrid  systems  are  time-varying 
they  lend  themselves  to  a  precise  and  complata 
qualitative  and  quantitative  analysis.  Among  such 
results  we  mention  the  possibility  to  explicitly 
compute  their  transition  matrices,  to  derive  and 
state  necessary  and  sufficient  conditions  for  their 
stability  (6],  and  most  interestingly  the  poss¬ 
ibility  to  derive  algebraic  controllability  and 
observability  tests  similar  to  tha  usual  ones  found 
in  the  theory  of  linear  time- invariant  systems  [6]. 
This  is  possible  due  to  the  many  features  hybrid 
systems  share  with  time- Invariant  systems. 
Moreover,  because  they  are  time-varying,  they  offer 


many  useful  features  due  to  their  variable 
structure  property. 

The  hybrid  systems  under  consideration  are 
ossuamd  to  have  tha  fora 

i(t)  •  A(r(t))x(t)  ♦  B(r(t) )u(t)  (1.1) 

y(t)  -  C(r(t))x(t)  (1.2) 

where  x  is  the  system  state  vector  of  dimension  n, 
u  is  the  control  input  vector  of  dimension  p,  y  is 
the  output  vector  of  dimension  a,  and  r(t)  is  the 
"fora  index"  which  is  a  deterministic  scalar 


sequence  taking  values 
N*{  1 ,  2 . N). 

in 

th« 

finite  index  set 

Such  modal  may  be 

tued 

to 

represent  systems 

subject  to  known  abrupt  parameter  variations  such 
as  coaiButated  networks  or  to  approximate  some  types 
of  time- varying  systems  [7].  The  latter  can  be 
done  by  imposing  a  "deterministic"  switching  rule 
on  tha  time  behavior  of  the  form  index.  However,  to 
modal  unknown  abrupt  phenomena  such  as  component 
and  interconnection  failures  the  form  index  can  be 
modeled,  for  example,  as  a  finite-state  Markov 
chain  (FSMC)  [3]. 

The  latter  problem  has  received  considerable 
attention  within  the  control  community  but  much 
work  still  remains  to  be  done.  Chizeck  et  al  [3] 
denotes  the  optimal  control  problem  of  such  systems 
the  Jump  Linear  Quadratic  (JLQ)  problem  since  they 
view  it  as  an  extention  of  the  standard  Linear 
Quadratic  (LQ)  problem.  However,  very  little 
attention  was  given  to  the  stabilization  and 
control  of  the  deterministic  version  of  the 
problem,  even  though  it  shares  many  features  with 
the  JLQ  problem.  This  paper  is  concerned  with  the 
latter  problem. 

Lat  S)4  denotes  any  sequence  of  length  M  of  the 
values  taken  by  r(t),  and  let  dtj  denotes  the  time 
interval  during  which  r(t)  ■  1.  Throughout  the 

paper  tha  following  assumption  is  made,  chat 
contains  all  the  values  that  r(t)  takes.  In  this 
ease  we  define 

N 

T  •  £  dtt  (2) 

i-1 

as  tha  period  of  tha  system.  If  in  addition  the 
sequence  in  every  Sg  is  the  same  the  system  is 
called  a  periodic  hybrid  systam.  It  will  be 
obvious  that  making  the  assisption  that  HAN  on 
will  not  affect  the  results.  The  as  sumption  that 
H  “  N  simplifies  the  notations.  Let  the  1th  form 
denote  the  realization  associated 

with  tha  ith  form  index  (i.e.,  r(t)  ■  1),  with  lcN. 
In  this  paper  N  ■  2,  so  that  we  are  concerned  with 
Flip-Flop  (F2)  systems  as  a  special  class  of  hybrid 
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systems.  The  F2-systems  '  switch  "hack  and  forth" 
batwaan  two  time- invariant  systaas  (two  forms),  2i, 
and  of  ldantlcal  d loans ions.  Tha  systaa  spends 
time-ur  at  2i(  1-1,2.  Tha  only  condition 
imposed  on  tha  switching  is  that  tha  systaa  can  not 
spand  aora  than  2T^  at  2^.  That  is,  no  ordar  is 
imposed  on  tha  switching  batwaan  tha  two  foras. 


ylald  Exp(C)  whara  C  is  as  in  (4).  Now  it  is 
daslrad  to  hava  an  axprasslon  for  C  that  is 
lndapandant  of  ordar,  to  agraa  with  tha  order 
independence  Introduced  in  the  definition  of  F2- 
systaas,  than  wa  oust  compute 

lni(Exp(A)Exp(B)+Exp(B)Exp(A) } 


Tha  following  is  an  outlina  of  tha  papar. 
Section  2  starts  by  introducing  tha  avaraging 
technique  based  on  a  Lie  algebraic  formulation.  It 
also  adrasses  tha  perturbations  induced  by  tha 
avaraging  procedure,  and  offers,  whara  possible,  an 
alternative  to  avaraging.  Finally  it  discusses  tha 
controllability  of  tha  average  systaa.  Tha 
stability  and  stabilisation  issues  of  hybrid 
systaas  are  treated  in  section  3.  Section  4 
concludes  tha  papar  and  points  to  additional  open 
problems . 

2.  Tha  Averaging  Technique 

In  this  section  wa  introduce  a  practical 
avaraging  technique  which  will  be  halpful  in  tha 
analysis  and  control  of  two-fora  hybrid  systeas . 
It  will  be  obvious  from  tha  following  trsataent 
that  tha  proposed  avaraging  method  applies  to 
multi-fora  hybrid  svsteas  as  wall.  Tha  aain  tools 
in  simplifying  tha  analysis  and  synthesis  of 
stabilizing  controls  for  such  systems  will  be  soaa 
basic  ideas  froa  linear  systaas  theory  coabinad 
with  tools  froa  Lie  algebras,  linear  algebra,  and 
stability  theory  of  ordinary  differential  aqua¬ 
tions. 

Tha  averaging  methodology  to  be  used  in  this 
paper  13  based  on  a  result  from  Lie  algebras  known 
as  the  Baker-Campbell-Kausdorff  Formula  12) :  Given 
two  real  matrices  A  and  B  there  is  no  guarantee 
that  there  exists  a  real  matrix  C  such  that 

Exp(A)Exp(B)  -  Exp(C) .  (3) 

This  will  be  the  case,  however,  if  IAI+IBI  i  ln(2) 
[8],  and  then  C  will  be  given  by  a  convergent 
infinite  expression 

C  -  A  +  B  +  (1/12)([A,B] , B]  + 

( 1/12) [ [8, A] ,A]  +  ...  (4) 

where  the  symbol  [A,B]  s  AB-BA  is  tha  commutator 
product.  This  expression  is  the  Baker -Campbe 11- 
Hausdorff  formula  (BCH). 

Similar  expressions  lika  tha  BCH  formula  are 
used  in  a  large  number  of  useful  approximations  in 
physics  [9]  and  switched  electrical  networks  [2]. 
In  this  section  wa  show  how  tha  BCH  formula  and 
related  expressions  can  be  used  to  analyse  and 
stabilise  F2-systaas  and  hybrid  systaas  in  general. 

Tha  concept  is  similar  to  tha  one  used  in  [2] 
to  stabilize  bilinear  systaas.  In  our  casa  wa  are 
interested  in  tha  stabilization  of  F2-systeas  whosa 
A-aatrix  satisfies 


A(t)  - 


fAj  for  0  i  t  <Tj 
(A2  for  T^  S  t  <T, 


(5) 


•  A+B+(1/12)[[A,B],B]+(1/12)([B,A],A] 

-  A+B+(1/12)((A,B),B-A].  (6) 

Therefore,  wa  obtain  a  series  of  approximations 
for  F2-systeas.  If  tha  F2-systaa  realization  is  2^ 
-  (Ai,b})  for  Tj,  and  £3  -  (A2.b2)  for  T2.  then  the 
first  approximation  is 

2  -  {aAj+( l-a)A2,  abj+( l-a)b2) ,  (7) 

with 

a  1  Tj/(Tj  +  T2), 
and  tha  second  approximation  is 

2  -  {[aAj+(l-a)A2+( l/12)a( l-a)[ [Aj ,A2] , 

(l-a)A2-aA1),  ab1+(l-a)b2).  (8) 

Some  comments  about  these  two  approximate 
expressions  are  in  ordar.  The  first  order  app¬ 
roximation  can  be  interpreted  at  least  in  two 
different  ways.  Tha  first  interpretation  is  a 
probabilistic  one;  it  says  that  tha  average  system 
can  be  vlawad  as  tha  probabilistic  average  of  tha 
hybrid  system  with  Pd-E^)  -  a  and  P(2-£2)  -  1-a. 

This  is  consistent  with  tha  frequency  interpreta¬ 
tion  idea  especially  when  we  are  interested  in  long 
tlme-rang-  behavior  of  the  system.  The  second 
interpretation  comes  froa  the  theory  of  variable 
structure  systems  (VSS)  and  the  Filippov's  con¬ 
tinuation  technique.  The  latter  technique  was 
introduced  to  study  the  behavior  of  the  system  in 
chattering  mode.  The  above  first  order  approxima¬ 
tion  is  nothing  but  a  Filippov's  average  system. 
Therefore,  a  hybrid  system  can  be  viewed  as  a  VSS 
system  in  chattering  mode  where  the  switching 
manifolds  > re  solution  orbits  of  the  average 
system. 

It  was  shown  in  [2],  via  an  example,  that  in 
soma  special  cases  tha  second  correction  term  is 
more  important  than  tha  first,  which,  in  fact, 
might  vanish.  Thus,  tha  usefulness  of  the  second 
approximation.  However,  in  [2]  there  was  no 
attaapt  to  analyse  tha  errors  introduced  by  the  BCH 
formula  and  tha  averaging  method.  Obviously,  there 
are  two  very  important  issues  in  using  such  a 
formula  and  averages  derived  froa  it.  The  first 
one  is  tha  error  introduced  by  only  using  few  terms 
in  tha  BCH  expression  while  computing  tha  average 
matrix.  Tha  second  one  is  tha  difference  between 
tha  actual  systaa,  in  our  casa  tha  F2- system,  or  in 
[2]  tha  bilinear  systaa,  and  tha  average  system 
used  to  reflect  tha  average  behavior  of  tha  system 
under  consideration.  Both  of  these  issues  have  to 
be  addressed  because  of  their  paramount  importance, 
especially  tha  difference  between  the  actual  systaa 
and  its  average  which  is  a  function  of  tha  error 
introduced  by  truncating  tha  BCH  formula  expres¬ 
sion.  Since  tha  latter  problems  require  a  lengthy 
discussion  only  a  summary  of  tha  results  is  given 
in  this  papar. 


and  it  is  desired  to  approximate  tha  expression  In  what  follows  we  present  soma  results  related 

Exp(A^T^)Kxp(A2(T-Ti).  Tha  BCH  formula  is  used  to  to  tha  accuracy  of  tha  usage  of  a  truncated  BCH 
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formula.  Using  the  BCH  formula,  one  can  obtain  an 
approximation  C  of  C,  to  any  desired  order. 
Consequently,  C  can  be  written  as  C  *  C+C,  where  C 
is  the  unknown  error  due  to  the  approximation. 
Therefore,  the  induced  error  in  computing  Exp(C)  by 
using  the  approximate  matrix  C  is 

E  «  Exp(CT)-Exp(Cl) .  (9) 

The  first  order  approximation  of  E  can  be  expressed 
In  terms  of  the  solution  of  a  linear  time- invariant 
matrix  differential  equation: 

Proposition  1  f 10 1 

Let  Ex  denote  the  first  order  approximation  in 
t  to  E,  then  Ex  satisfies  the  following  matrix 
differential  equation 

Y(t)  •  CY  +  cCpExp(Ct) ,  Y(0)  -  0.  (10) 

where  C  s  eCp,  end  c  is  a  positive  scalar. 

In  order  to  compute  upper  bounds  for  Ex  and  E 
the  following  results  may  be  used. 

Proposition  2 

Assume  that  IExp(Ct)l  S  M(t)Exp(J5( t)t) ,  with 
B(t)  a  scalar  function,  then 

■Ei I  S  ( lCl/2l£l )M(t)« 

Exp{0(t)t}(Exp{2lClt)-l).  (11) 

The  use  of  results  in  [10],  and  the  assumption  that 

M(t)  is  monotonlc  yields 

IEI  S  tl£lM2(t)Exp{(B(t)  +  M(t)lCl)t).  (12) 

Sometimes  it  is  possible  to  avoid  the  computa¬ 
tion  of  A-"average".  That  is,  under  certain 
conditions,  it  is  possible,  via  state  feedback,  to 
make  the  P2-system  time- Invariant  in  A.  That  is 
given  Ix  end  E2,  end  the  appropriate  conditions 
satisfied  by  the  given  forms,  one  can  compute  a 
feedback  gain  matrix 

X  -  [Ki,K2]  (13) 

which  will  make  the  A-matrices  of  both  forms  equal 
and  cosequently  render  the  A-matrlx  of  the  hybrid 
system,  upon  closing  the  loop  via  Ki  for  Ex, 
constant.  Moreover,  this  constant  A  matrix  is 
given  by  the  following  expression 

A  -  Ax  -  BXK1  -  A2  -  B2X2.  (U) 

In  this  section  necessary  and  sufficient 
conditions  ere  derived  for  the  existence  of  K  end  e 
compact  computation  recipe  based  on  the  Eronecker- 
product  end  the  generalised- inverse  techniques  is 
given. 

Theorem  1 

Given  the  P2- system 

i(t)  -  AiX  ♦  B^.  i  •  1,2.  (15) 

much  that 

*■»!•[ Ai-Aj.Bi.-Bj]  •  tenge [Bi.-Bjl,  (16) 

then,  there  exists  e  minimise- no  re  G  ■  [Ei  ,E?  ]  such 
that 

A1  *  *1*1  *  *2  "  *2*2-  (17) 


Moreover,  the  G  matrix  is  given  by 

•  (G)  -  <[Bi,-B2)+  •  I^sUi-Aj),  (18) 

where  •(.)  is  the  stacking  operator. 

Proof 

Starting  with  the  forms  £i(  i-1,  2,  the  ith 

model  is  given  by 

x(t)  -  Axx  +  Biu.  (19) 

If  we  define  A  s  Ai-6Ai  the  above  system  can  be 
written  as  follows 

x(t)  -  Ax  +  4Axx  +  Bxu,  (20) 

and  the  next  step  is  to  compute  a  gain  matrix 
such  that 

4Axx  ♦  Bxu  “  4AjX  -  BiKxx  ■ 

(4At  -  Bi^Jx  -  0  for  all  x.  (21) 

This  is  equivalent  to 

Biiq-  4Ai(  (22) 

which  is  nothing  but  an  algebraic  equation  for  the 
unknown  Ex.  Therefore,  for  Ex  to  exist  one  needs 
the  well  known  condition 

renk[Bi,4Ai]  -  rankfBi),  1—1,  2  (23) 

which  is  equivalent  to  the  existence  of  some  matrix 
Ex  such  that 

4Ax  -  BiKi,  1-1,  2.  (24) 

Substructing  the  first  equation  from  the  second 
yields 

Ax  -  A2  -  BxKi  -  B2K2  (25) 

which  can  be  written  as  follows 

(Ax  -  A2]  -  tB1,-B2][E{,KJ] '  (26) 

which  is  Itself  an  algebraic  equation  with  the  Ex, 
1*1,2,  as  unknown  and  the  condition  given  in  the 
theorem  is  the  one  needed  for  the  existence  of  both 
gains.  Equation  (18)  is  nothing  but  a  compact  way 
to  write  such  equations.  As  a  matter  of  fact  it  is 
very  useful  when  nusierlcal  techniques  are  used  to 
solve  the  problem. 

Corollary 

If  Range [Ai-A2,Bi'B2]  -  RangeCBi'B^ ,  then 

1  ■  KX  -  E2,  (27) 

end  the  gain  matrix  la  given  by: 

•(E)  -  {[BrB2]t  •  In)s(A1-A2).  (28) 

Proof 

When  Ex  «  X2  -  X  is  needed  the  proof  of  the 
above  theorem  is  changed  accordingly  to  yield  the 
results  stated  in  the  theorem. 

We  mow  return  to  the  average  system.  One  of 
the  key'asswptlons  made  to  design  the  regulator 
via  averaging  Is  the  controllability  of  the  average 
•ystam.  This  assumption  is  not  unreasonable  since 
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(33) 


the  controllability  proparty  of  linear  time- 
invariant  aystaaa  is  generic.  However,  one  can 
construct  hybrid  systems  such  that  their  .averages 
are  not  controllable  (11].  In  [11]  a  sufficient 
condition  that  identifies  a  class  of  hybrid  system s 
for  which  the  hybrid  system's  controllability 
guarantees  the  controllability  of  the  average 
system  was  given.  The  result  is  stated  in  the 
following  theorem. 

Theorem  2 

The  average  system  of  a  hybrid  system  is 
controllable  if 

a.  Rank  [Cp  C2,  ....  Cu]  -  n. 

b.  All  forms  are  simultaneously  diagonaliz- 
able. 

3.  Stability 

Even  though  Hybrid  systems  are  time-varying 
systems  it  is  possible  to  obtain  necessary  and 
sufficient  asymptotic  stability  conditions  [6]. 
However,  the  latter  condition  is  computationally 
time  consuming  and  a  simple  sufficient  asymptotic 
stability  condition  was  presented  to  alleviate  the 
computational  burden.  In  this  section  the  same 
sufficient  condition  is  rederlved  using  other 
means,  which  may  be  genera li ted. 

In  order  to  rederive  this  sufficient  condition 
a  brief  introduction  to  the  notion  of  logarithmic 
norm  is  given.  The  logarithmic  norm  (also  known  as 
the  logarithmic  derivative,  the  measure  of  a 
matrix)  was  introduced  in  1938  separately  by 
Dahlquist  [12]  and  Lozinskij  [13]  as  a  tool  to 
study  the  growth  of  solutions  to  ordinary  differen¬ 
tial  equations  and  the  erroi.  growth  in  discretiza¬ 
tion  methods  for  their  approximate  solution.  It  is 
formally  defined  as  follows: 

Definition 

The  logarithmic  norm  associated  with  the  matrix 
norm  l.l  is  defined  by 

u(A)  •  lim  (II  +  hAI  -  1 )/h.  (29) 

h-*0+ 

The  explicit  expression  for  the  logarithmic  norm 
associated  with  the  Euclidian  norm  is 

u(A)  -  max(u  :  u  c  A( (A+A*)/2) } .  (30) 

Then  the  following  inequality  is  true: 

IExp(At) I  S  Exp(u(A)t ) .  (31) 

Now  we  are  ready  to  apply  the  logarithmic  norm 
to  derive  a  simple  sufficient  condition  to  test  for 
the  stability  of  hybrid  systems. 

Theorem  f 1 A 1 

Let  t  *  A(t)  be  a  regulated  function  from 
[0,  •)  to  C"*®.  Then  the  solution  of 

i(t)  -  A(t)x(t)  (32) 

satisfies  the  inequalities 

t 

lx(t,)l  ExpW  u[ -A( t ' )  ]dt ' } 

t* 


t 

i  I x( t ) I  Slx(t,)l  Exp  /  u[A( t' ) ]dt' . 

t. 

Basically  the  theorem  states  that  the  rate  of 
change  of  the  norm  of  the  state  vector  x(t)  is 
bounded  by  u(A(t))  and  to  insure  stability  this 
bound  must  be  negative. 

Theorem  3 

For  the  null  solution  of  the  hybrid  system  (1) 
to  be  uniformly  asymptotically  stable,  it  is 
sufficient  to  have 

£  ^(AjJpi  <0,  Pi  »  (t1-ti.1)/T,  icN.  (34) 
i 

Before  giving  the  proof  of  the  theorem  we 
introduce  a  different  way  to  represent  hybrid 
systesu  (l)-(2).  This  new  formulation  has  the 
advantage  of  simplifying  certain  proofs.  This  new 
representation  is  as  follows 

N  N 

x(t)  -  {  £  Vi(t)Ai)x(t)  +  {  Z  Vi(t)Bi)u(t)  (35) 
1-1  i-1 

N 

y(t)  -  {  Z  Vi(t)Ci>x(t)  (36) 

1-1 

where  v,(t)  -  1  when  the  system  is  governed  by  the 

1th  realization  Ip  and  vt(t)  -  0  otherwise.  The 
Vi(t)  function  is  called  the  1th  indicator  func¬ 
tion.  It  is  evident  from  the  definition  of  hybrid 
systeass  that  at  any  point  in  time  only  one  of  the  N 
indicator  functions  is  one.  Now  we  prove  the 
theorem. 

Proof 

Using  the  above  representation  the  homogeneous 
part  of  a  hybrid  system  can  be  written  as 

N 

x(t)  -  {  Z  v1(t)Ai)x(t)  (37) 

i-1 

Using  Theorem  27  in  [14]  one  can  write 

t 

lx(t)l  i  lx(t0)lExp{  /  u(A(s))ds) 

^0 

t  N 

-  Ix(t,)l  Exp{  /  ul  Z  vi(s)Ai]ds  )  (38) 

t0  1-1 

Using  Theorem  5(e,  d)  in  [14]  yields 
N  t 

lx(t)l  S  lx(t,)|  Exp{  Z  I  vi(s)u[Ai]ds] 
i-l  t, 

N  t 

-  Ix(t,)l  Exp{  Z  u[A1]  /  v1(s)ds) 

i-1  t, 

N  t 

-  Ix(t,)l  Exp{(  £  utAjKl/t-t,)  S  v1(t)ds)(t-t0)) 

i-1  t, 

N 

-  Ix(t,)l  Exp{(  £  piu[A1])(t-t0)}  (39) 

i-1 

with 
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t 

pA  «  (l/(t-t0))  /  vi(*)dj.  (40) 

This  completes  the  proof  sfter  taking  the  limits. 

This  simple  sufficient  condition  states  that 
for  a  hybrid  system  to  be  uniformly  asymototically 
stable  the  weighted  average  of  the  logarithmic 
norms  of  each  realization  has  to  be  negative. 
Therefore,  this  sufficient  condition  allows  for 
unstable  forms.  That  is,  as  long  as  the  stable 
forms  dominate,  the  overall  system  is  asymptotical¬ 
ly  stable.  This  domination  can  occur  in  two  ways: 
either  the  stable  forms  are  strongly  stable  (l.e., 
highly  negative  logarithmic  norms),  or  the  time 
span  of  the  stable  forms  is  large  relative  to  the 
time  span  of  the  unstable  ones  or  a  combination  of 
the  latter  two  reasons. 

The  above  Interpretation  provides  a  mathemati¬ 
cal  rationale  to  the  observations  made  by  Chizeck 
et  al  [3]  while  analyzing  such  systems. 

4.  Stabilization 

This  section  presents  some  results  concerning 
the  control  and  stabilisation  of  hybrid  systems. 
These  results  use  standard  techniques  to  con¬ 
trol/stabilize  hybrid  systems. 

Definition 

A  hybrid  system  is  stabilizable  if  there  exists 
a  constant  feedback  gain  matrix  (  such  that  the 
closed  loop  hybrid  system  is  asymptotically  stable. 

Theorem  4 

A  hybrid  system  is  stabilizable  if 

a.  The  average  system  is  stabilizable, 

b.  The  following  inequality  is  satisfied 

N 

I  u[«Ai  -  dBiKJpi  <  0,  (41) 

i-1 

where  K  is  a  stabilizing  gain  matrix  of  the  average 
system  and  6E^s(6A^,  6B^)  is  the  difference  between 
the  1th  realization  and  the  average  system. 

Proof 

Given  a  hybrid  system  with  a  stabilizable 
everage  then  there  exists  at  least  one  constant 
gain  matrix  K  such  that  the  average  closed  loop 
matrix  (A  "K)  is  Hurwitz.  Therefore,  Saverags^^ 
is  asymptotically  stable.  But  the  actual  system 
response  is  composed  of  two  components,  the  average 
system  component  end  the  error  component.  That  is 

x(t)  -  k.vrBgmU)  ♦  e(t)  (42) 

where  the  error  dynamics  ere 

e(t)  •  t  T1(t)(4A1-«BiK]e.  (43) 


Condition  b  Is  e  sufficient  requirement  for  e(t)  to 
be  asymptotically  stable  which  proves  the  theorem. 

The  following  example  la  <;ivvn  to  illustrate 
the  results. 


Example 

Given  the  following  F2-system  E(a) 

ij(t)  -  -ax1  +  x2  +  u,  (44) 

i2(t)  -  (1  -  a)x2  +  au.  (45) 

where  a  *  Tj/T,  E^  "  E(l)  end  E2  ■  E(0).  The  above 
system  is  the  exact  average  system.  The  transfer- 
function  of  the  average  system  is  given  by 

H(s)  -  (s  +  (2a  -l))/(s  +  a)(s  -1  +  a).  (46) 

For  a  >  0.5,  (46)  is  a  minimum-phase  transfer 
function,  otherwise  it  is  not.  Using  usual 
techniques  the  minimum-phase  case  can  be  stabilized 
with  an  output  feedback  gain  K.  For  a».8  and  K«5 
the  closed  loop  average  system's  poles  are  .i-1. 13 
and  S2“-10.1.  However,  the  logarithmic  norm  test 
applied  to  the  error  dynamics  gives  en  upper  bound 
equal  to  zero,  implying  that  the  error  dynamics  are 
not  unstable.  A  graph  of  the  (a.K)-stabilizability 
domain  and  two  phase-space  simulations  are  given  in 
Fig.  1  and  Fig.  2,  respectively,  to  illustrate  the 
stability  results  and  the  effect  of  the  feedback 
gain  K  on  the  dynamics  of  the  closed  loop  system. 

5.  Conclusion 

An  averaging  method  based  on  the  Bake r- Camp¬ 
bell  -Ha  usdorff  formula  was  introduced  for  computing 
the  average  of  a  hybrid  system.  In  order  to  be 
able  to  find  how  well  the  average  system  is 
approximating  the  actual  system  upper  bounds  of  the 
error  induced  by  averaging  are  given.  Furthermore, 
it  was  shown  that  under  certain  conditions  one  can 
avoid  the  averaging  of  the  A-matrix  and  therefore 
minimize  the  errors  introduced  by  the  averaging 
procedure.  This  is  done  via  state  feedback  by 
making  the  A-matrix  constant. 

The  controllability  property  of  the  average 
system  is  a  key  assumption  in  the  stabilization 
procedure  given  in  the  paper.  For  that  reason  a 
sufficient  condition  that  identifies  a  class  of 
hybrid  systems  for  which  the  average  is  controll¬ 
able  was  given.  Therefore,  the  class  of  hybrid 
systems  with  a  controllable  average  is  a  research 
topic  in  need  of  further  investigation. 

The  stability  of  hybrid  systems  is  still  far 
from  being  solved.  This  is  mainly  due  to  the  fact 
that  hybrid  systems  are  time-verying  systems.  In 
the  paper  a  sufficient  stability  condition  was 
derived.  This  condition  is  based  on  the  logarith¬ 
mic  norm  concept.  One  important  point  Co  be  inves¬ 
tigated  about  this  stability  condition  is  how 
conservative  it  is?  The  variable  structure 
property  seems  to  be  a  promising  feature  in  this 
direction.  Furthermore  if  one  thinks  of  every 
system  Ej“(A^,Bj ,C1)  with  lcM  as  an  operator  acting 
oe  the  state  x  during  <3t^,  and  these  operators  are 
applied  in  a  successive  manner,  then  this  process 
can  be  viewed  as  an  iterative  process  [15]. 
Viewing  a  hybrid  system  as  an  Iterative  process 
sheds  some  light  on  soma  complicated  issues  such  as 
the  stability  of  such  systems. 

Finally  adapting  the  results  of  this  paper  to 
hybrid  systems  where  the  switching  is  a  stochastic 
process  such  as  a  Harkov  chain  can  be  easely  done. 
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ABSTRACT 

This  paper  analyzes  the  errors  introduced  by  the 
averaging  of  hybrid  systems.  These  systems  involve 
linear  systems  which  can  take  a  number  of  different 
realizations  based  on  the  state  of  an  underlying  finite 
state  process.  The  averaging  technique  (based  on  a 
formula  from  Lie  algebras  known  as  the  Baker-Campbell- 
Hausdorff  (BCH)  formula)  provides  a  single  system 
matrix  as  an  approximation  to  the  hybrid  system.  The 
two  errors  discussed  are:  a)  The  error  induced  by  the 
truncation  of  the  BCH  series  expansion,  and  b)  The 
error  between  the  actual  hybrid  system  and  its  average. 
A  simple  sufficient  stability  test  is  proposed  to  check 
the  asymptotic  behavior  of  this  error.  In  addition, 
conditions  are  derived  that  allow  the  use  of  state 
feedback  to  arrive  at  a  time-invariant  system  matrix 
instead  of  averaging. 

1.  INTRODUCTION  AND  PROBLEM  FORMULATION 

Hybrid  systems  are  a  special  class  of  piece-wise 
constant  time-varying  systems.  Such  models  switch  at 
different  time  instants  among  a  finite  set  of  linear 
time- invariant  realizations.  Systems  of  this  type  can 
be  used  to  model  systems  subject  to  known  abrupt 
parameter  variations  such  as  synchronously  switched 
linear  systems  [1],  networks  with  periodically  varying 
switches  (2]  or  to  approximate  some  types  of  time- 
varying  systems  [3].  This  is  achieved  by  imposing  a 
deterministic  switching  rule  [4],  To  model  unknown 
abrupt  phenomena  such  as  systems  subject  to  failures 
[5],  the  switching  can  be  modeled,  for  example,  as  a 
finite-state  Markov  chain  (FSMC).  An  earlier  review  of 
hybrid  systems  may  be  found  in  [6]. 

Averaging  theory,  which  is  used  in  a  deterministic 
or  probabilistic  context,  is  an  approach  to  the 
approximation  of  such  systems  by  a  single  constant 
linear  model.  In  the  probabilistic  case  averaging  is 
introduced  in  a  natural  way  by  taking  expected  values. 
In  the  deterministic  case,  however,  averaging  is 
intr  '.ced  via  perturbation  techniques.  Brockett  and 
Wood  [2]  used  a  deterministic  averaging  technique  to 
analyze  and  stabilize  a  class  of  bilinear  systems  which 
are  very  hard  to  analyze  or  control  otherwise.  Geman 
(7)  used  probabilistic  averaging  techniques  to  study 
the  stability  of  random  differential  aquations.  His 
main  interest  was  to  explore  the  relation  between 
asymptotic  rtability  in  the  average  equation,  and 
asymptotic  stability  in  the  random  equation. 
Specifically,  when  does  the  first  imply  the  sacond? 
Kosut  at  al.  [8]  applied  the  theory  of  averaging  to  the 
analysis  of  the  stability  of  adaptive  systems.  Ezzina 
and  Haddad  (9]  used  an  averaging  technique  very  similar 
to  the  one  used  in  [2]  to  analyze  and  stabilize  hybrid 
systems  via  a  nonswitching  gain.  As  a  matter  of  fact, 
Marlton  et  al.  [10]  showed  that  nonswitching  control 
gains  may  be  preferable,  in  addition  to  the  fact  that 
chey  are  ouch  easier  to  implement. 

In  this  paper  the  averaging  procedure  used  in 


{2,9]  is  considered  further.  In  [2]  there  was  no 
attempt  to  analyze  the  errors  introduced  by  the  BCH 
formula  and  the  averaging  method.  However,  there  are 
two  very  important  issues  in  using  such  a  formula  and 
averages  derived  from  it  as  mentioned  in  (9].  The 
first  is  the  error  introduced  by  truncating  the  BCH 
expression  while  computing  the  average  matrix.  The 
second  is  the  difference  between  the  actual  system,  in 
[9]  the  F2-system,  or  in  [2]  the  bilinear  system,  and 
the  average  system  used  to  approximate  the  average 
behavior  of  the  system  under  consideration.  This  paper 
addresses  both  Issues  by  providing  bounds  on  the 
resulting  errors. 

Furthermore,  the  paper  provides  conditions  under 
which  the  BCH  formula  can  be  avoided.  Instead  of  using 
the  BCH  formula  to  compute  an  average  system  matrix, 
state  feedback  is  used  to  obtain  a  constant  closed  loop 
matrix  for  the  system. 

The  class  of  hybrid  systems  considered  in  this 
paper  are  assumed  to  have  the  form 

i(t)  -  A(r(t))x(t)  +  B(r(t))u(t)  (la) 

y(t)  -  C(r(t))x(t);  (lb) 

where  x  is  the  system  state  vector  of  dimension  n,  u  is 
the  control  input  vector  of  dimension  p,  y  is  the 
system  output  vector  of  dimension  m,  and  r(t)  is  the 
"form  index"  which  is  a  deterministic  scalar  sequence 
taking  values  in  the  finite  index  set  (f*(l,  2,  ...,  N). 
Let  the  ith  form  denote  the  realization  ^-(Aj.Bi.C^) 
associated  with  the  ith  form  index  (i.e.,  r(t)“i),  for 
IcN. 

It  is  assumed  that  any  r(t)  sequence  is  composed 
of  a  succession  of  N-termed  blocks.  Every  block  is  a 
permutation  of  the  index  set  N.  It  is  important  to 
note  that  the  succession  of  the  blocks  is  completely 
arbitrary  (e.g.,  for  N«3,  a  possible  r(t)-sequence  is: 
123,  321,  213,  213,  312,  ...).  The  time  interval 
during  which  r(t)”i  is  denoted  by  6t^.  In  this  case  we 
define 

N 

T  s  S  Jtj  (2) 
i-1 

as  the  period  of  the  system.  Piece-wise  constant 
periodic  systems  are  a  special  class  of  hybrid  systems. 
Therefore,  from  an  application  point  of  view  the 
subsequent  results  can,  at  least,  be  applied  to  the 
periodic  case.  However,  the  primary  motivation  is  to 
derive  results  that  can  be  applied  to  the  case  where 
the  switching  is  governed  by  a  FSMC. 

The  following  is  an  outline  of  Che  paper.  Section 
2  begins  with  an  overview  of  the  averaging  technique 
for  hybrid  systems.  It  also  addresses  the 
perturbations  induced  by  the  averaging  procedure.  Two 
important  perturbation  errors  are  identified,  and  the 
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first  one  is  analyzed.  Section  3  discusses  the  second 
error,  and  also  offers,  where  possible,  an  alternative 
to  averaging.  Section  4  concludes  the  paper  and  points 
to  additional  open  questions. 

2.  DERIVATION  OF  THE  PERTURBATION  BOUNDS 

This  section  addresses  the  accuracy  of  the 
averaging  technique.  First,  upper  bounds  for  the 
errors  introduced  by  using  a  truncated  BCH  formula  are 
derived.  . 

The  averaging  methodology  to  be  analyzed  in  this 
paper  is  based  on  a  formula  from  Lie  algebras  known  as 
the  Baker-Campbell-Hausdorf f  (BCH)  formula  (2,11,12): 
Given  two  real  matrices  A  and  B  there  is  no  guarantee 
that  there  exists  a  real  matrix  C  such  that 

Exp(A)Exp(B)  -  Exp(C).  (3) 

However,  if  IAl+IBISln(2),  then  C  exists  [11],  and  will 
be  given  by  a  convergent  infinite  expression  (the  BCH 
formula) 

C  -  A+B+( 1/ 12)[[A,B],B)+(l/l2)(tB,A],A]+...  (4) 

where  the  symbol  [A,B)saB-BA  (i.e.,  the  commutator 
product) . 

Similar  expressions  like  the  BCH  formula  are  used 
in  a  large  number  of  useful  approximations  in  physics 
[12]  and  switched  electrical  networks  (2).  In  the 
sequel  we  show  how  the  BCH  formula  and  related 
expressions  can  be  used  to  compute  the  average  of  In¬ 
form  hybrid  systems.  However,  for  notational 

simplicity  F2-systems  (i.e.,  two-form  hybrid  systems) 
only  are  treated. 

The  averaging  idea  introduced  in  [2]  and  used  in 
[9]  to  stabilize  hybrid  systems  is  outlined  in  the 
following  section.  Given  an  F2-system  such  that 

{A,  for  0  S  t  <  Ti, 

(5) 

A2  for  T[  S  t  <  T. 

then  an  approximation  for  Exp(A]Tj)Exp(A2(T-Tj)  is 
desired.  The  BCH  formula  provides  the  approximation  as 
Exp(CT),  where  C  is  as  in  (4).  Now  if  we  require  an 
expression  for  C  independent  of  the  order  of  the 
product,  to  agree  with  the  order  independence 
introduced  in  the  definition  of  F2-systems,  then  we 
must  compute 

lni(Exp(A)Exp(B)+Exp(B)Exp(A)} 

«A+B+(1/12)[(A,B),B]+(1/12)[[B,A],A] 

-A+B+(1/12)[[A,B),B-A].  (6) 

Therefore,  we  obtain  a  series  of  approximate 
averages  for  F2-systems.  If  the  F2-system  realization 
is  Ij-fAj.bj)  for  period  Tj,  and  l2"(A2>^2^  for  POrio<1 
T2,  then  the  first  order  approximation  is 

I  “  (oAj+( l-a)A2>  abj+( l-a)b2) ,  (7) 

with  a  ■  Tj/(Tj  +  T2).  Second-order  terms  may  be 
included  to  obtain  the  approximation 

I  -  ([eA1+(l-o)A2+(l/12)a(l-a)[[A1.A2), 

(l-ajAj-ttAj),  abj+( l-a)b2) .  (8) 

Higher-order  approximations  may  also  be  derived. 
Consequently,  if  we  let  C  denote  the  approximating 


matrix,  then  C  can  be  written  as  C“C+£,  where  £  is  the 
error  due  to  the  approximation.  Therefore,  the  induced 
error  in  computing  Exp(CT)  by  using  the  matrix  C  is 

E  i  Exp(CT)-Exp(CT).  (9) 

The  solution  formula  for  inhomogeneous 
differential  equations  can  be  used  to  derive  an  exact 
expression  of  E  [13]: 

T 

E  -  /  (Exp(C(T-s))£Exp((C+£)s)>ds.  (10) 

0 

In  this  section  a  useful  approximate  expression 
for  E  is  derived  using  perturbation  techniques.  To  do 
so  we  define  £5cCp,  where  c  is  a  scalar.  It  is 
recalled  [13]  that  ii  (£,£)  commute,  that  is, 

ee  -  c£,  (in 

then 

Exp((£+cCp)T)  -  Exp(CT)Exp(cCpT) 

»  Exp(CT)(I  +  cCpT  +  c2C2T2/21  +...).  (12) 

To  find  Exp((£+C)T),  when  £  and  Cp  do  not  commute,  one 
can  use  an  iterative  technique  similar  to  the  one  used 
to  derive  the  exact  expression  for  E  [13).  Hence  one 
can  write 

Exp((C+C)T)  -  Exp(CT)  + 

T 

cExp(CT)  /  Exp(-£s)C_Exp(£s)ds  +  0(c2).  (13) 

0 

This  is  the  Liouville-Neumann  series  solution  for  the 
integral  equation.  It  is  a  convergent  perturbation 
series  for  all  c  [13). 

However,  these  exact  expressions,  given  as  a 
series  in  c,  do  not  lend  much  insight  to  qualitative 
analysis  questions.  In  that  regard,  transforming  those 
integral  expressions  for  E  into  differential  equations 
might  be  more  useful.  We  first  introduce  the  following 
definition: 

Ej  :  E  ■  0( n2) 

T 

•  cExp(CT)  /  Exp(-Cs)C_Exp(Cs)ds,  (14) 

0 

where  Ej  is  the  first  order  approximation  to  E  in  c. 
Nov  Ej  can  be  expressed  as  follows. 

Proposition  1  [14] 

Let  Ej  denote  the  first  order  approximation  in  c 
to  E,  then  Ej  satisfies  the  following  matrix 
differential  equation: 

Y  -  CY  +  c>CpExp(£t) ,  Y(0)  -  0.  (15) 

As  a  consequence  of  the  above  representation  one 
can  use  the  theory  of  linear  matrix  differential 
equations  to  study  the  qualitative  behavior  of  E,.  For 
example,  one  can  show  that  if  all  eigenvalues  of  C  have 
negative  real  parts  then  Ej(t)  «  0  as  t  *  •- 

Moreover,  it  is  possible  to  derive  a  general 
explicit  expression  for  Ej .  To  do  #o  we  first  recall 
the  well  known  result  [12] 
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Exp(sA)BExp(-sA)  -  E  (s*/i! ,  (16) 

i-0 

with  (A°,B}“B  and  [An+1 ,B}-[ A, {An,B} ] .  Using  this 
identity  the  general  explicit  expression  for  £j  will 
follow 

-E,  -  eExp(Ct)  E  (ti/il){(-C)i'1,  Cp).  (1?) 
i-1 

At  this  point,  we  are  ready  to  compute  upper 
bounds  for  E}  and  E. 

Proposition  2 

Assume  that  IExp(Ct)#£M(t)Exp(8(t)t),  with  0(t)  a 
scalar  function,  then 

IE1IS(lCl/2lCl)M(t)Exp{3(t)t}(Exp{2lClt}-l),  (18) 

and  with  the  added  assumption  of  M(t)  being  monotone 
then 


IEI  £  tlClM5(t)Exp{(0(t)  +  M(t)lCl)t). 

(19) 

Proof 

Using  (16)  in  the  evaluation  of  the 
(14)  yields 

integral  in 

Ei  -  Exp(-At)  Z  (ti/i!)(Ai‘1,B). 
i*l 

(20) 

Taking  the  norm  of  both  sides  in  (20)  and  using  the 
following  inequality 

1 { An , B } 1  £  2n  IAnl  IBI  -  l2Aln  IBU 

(21) 

leads  to 

lEi*  £  IExp(-At)l  E  (ti/i!)l2Ali'1IBI, 
i-1 

(22) 

which  after  simple  algebra  results  in  (18) 
(19)  follows  in  the  same  manner  by 

.  Equation 
using  the 

monotonicity  property  of  M(t). 

3.  STABILITY  OP  THE  ERKOR  DYNAMICS 

In  this  section  the  dynamics  of  the  error  between 
the  average  system  and  the  actual  hybrid  system  are 
derived.  The  stability  of  the  error  dynamics  is 
discussed  and  two  stability  criteria  are  introduced. 

Civen  a  homogeneous  N-Form  hybrid  system  with 
state  vector  x(t),  and  one  of  its  time- invariant 
averages  with  state  vector  xa(t),  the  error  is  given  as 

e(t)  *  x(t)  -  xa(t)  (23) 

From  this  definition  it  is  easy  to  see  that  the  error 
dynamics  are  governed  by  the  following  hybrid  system 

N 

e(t)  «  (  E  Vj(t)  6Ai)e(t)  (24) 

i-1 

where  SAjiA^-A,  and  the  Indicator  functions  v^(t)  are 
defined  by:  vl(t)-l  when  the  original  hybrid  system  is 
described  by  the  ith  realization  E^,  and  v1(t)-0 
otherwise. 

At  this  point  two  stability  criteria  are 
introduced  to  check  the  stability  of  (24).  The  first 


criterion  is  a  necessary  and  sufficient  condition  [4] 
and  the  second  one  is  a  sufficient  test  only  (9,  4). 
Because  the  first  condition  is  computationally  involved 
and  is  a  generalization  of  a  well  known  result  (see 
(4))  we  choose  to  present  the  second  one. 
Interestingly  enough,  the  second  test  is  more  general 
in  the  sense  that  it  can  be  easily  generalized  to  a 
larger  class  of  hybrid  systems. 

In  order  to  state  this  sufficient  condition  a 
brief  introduction  to  the  notion  of  logarithmic  norm  is 
given. 

The  logarithmic  norm  (also  known  as  the 
logarithmic  derivative,  the  measure  of  a  matrix)  was 
introduced  in  1958  separately  by  Dahlquist  [15)  and 
Loz inski j  [16]  as  a  tool  to  study  the  growth  of 
solutions  to  ordinary  differential  equations  and  the 
error  growth  in  discretization  methods  for  their 
approximate  solution.  It  is  formally  defined  as 
follows : 

Definition 

The  logarithmic  norm  associated  with  the  matrix 
norm  l.l  is  defined  by 

u(A)  -  lim  (II  +  hAI  -  l)/h  (25) 

h-*0+ 

Explicit  expression  for  the  logarithmic  norm  associated 
with  the  Euclidean  norm  is 

u( A)  -  max(u  :  u  c  \( (A+A*)/2) } ,  (26) 

where  A(A)  is  the  set  of  eigenvalues  corresponding  to 
the  matrix  A.  Then  the  following  inequality  is  true: 

IExp(At)l  £  Exp ( u ( A ) t ) .  (27) 

Theorem  3 

For  the  null  solution  of  the  hybrid  system  (24)  to 
be  uniformly  asymptotically  stable,  it  is  necessary 
that 

Z  u(-Ai)pi  >  0,  (28a) 

i 

and  sufficient  to  have 

E  u(Ai)pi  <  0,  (28b) 

i 

where 

Pi  5  (ti-ti_i)/T,  ieN. 

Proof 

We  start  by  showing  that  the  sufficient  condition 
holds.  Using  Theorem  27  in  [17]  one  can  write 

t 

le(t)l  £  le(to)lExp(  /  u(A(s))ds) 

£0 

t  N 

-  Ie(t0)l  Exp(  /  u[  E  vj( t  )Aj  Jds  ).  (29) 

t0  i“l 

Using  Theorem  5(e,  d)  in  [17]  and  after  some  algebra  we 
get 

N  t 

limle(t)l  £  limle(tQ)l  Exp{  E  /  vt( t)ul A| Jds) 

t—  t—  i-l  t0 
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(35) 


N 

•  limle(t0)l  Exp{  (  Z  Pjul Aj ] )(t-t0))  (30) 

t—  i-1 

with 

t 

Pi  -  Um(l/(t-t()))  /  vj(*)ds.  (31) 

t—  t0 

which  completes  the  proof  of  (28b). 

The  necessary  condition  (28a)  is  shown  similarly 
by  using  the  fact  that  Exp-(u(-A))SIExpAtl . 

This  simple  sufficient  condition  states  that  for  a 
hybrid  system  to  be  uniformly  asymptotically  stable  the 
weighted  average  of  the  logarithmic  norms  of  each 
realization  has  to  be  negative.  Therefore,  this 
sufficient  condition  allows  for  unstable  forms.  That 
is,  as  long  as  the  stable  forms  dominate  the  overal 
system  is  asymptotically  stable.  This  domination  can 
occur  in  three  ways:  either  the  stable  forms  are 

strongly  stable  (i.e.,  highly  negative  logarithmic 
norms)  or  the  time  span  of  the  stable  forms  is  large 
relative  to  the  time  span  of  the  unstable  ones  or  a 
combination  of  both  reasons.  This  stability  property 
of  hybrid  systems  was  reported  in  [5]  via  examples. 

The  difference  between  (28b)  and  (28a)  could  be 
used  as  a  measure  of  the  conservativeness  of  (28b). 

Sometimes  it  is  possible  for  F2-systems  to  avoid 
the  computation  of  an  average  matrix  and,  consequently, 
minimize  the  errors  induced  by  averaging  [9].  That  is, 
under  certain  conditions,  it  is  possible,  via  state 
feedback,  to  make  the  closed  loop  F2-system  A-matrix 
time -invariant.  That  is,  given  Ij  and  Zj  satisfying 
appropriate  conditions,  one  can  compute  a  feedback  gain 
matrix 

K  -  [Kj  J  K2J  (32) 

which  will  make  the  A-matrices  of  both  forms  equal,  so 
that  the  A-matrix  of  the  closed  loop  hybrid  system 
(using  gain  Kj  for  Ej)  becomes  a  constant.  Moreover, 
this  constant  A  matrix  is  given  by  the  following 
equation 

A  «  Aj  -  BjKj  -  A2  *  »2k2-  (33) 

In  [18],  Marlton  proposed  a  technique  quite 
similar  to  this  idea.  He  showed  that  it  is  possible  to 
solve  the  Jump  Linear  Quadratic  (JLQ)  problem  by  making 
the  performance  index  independent  of  the  different 
realizations  of  the  form- index  r(t).  His  approach 
renders  the  cost  incurred  by  any  realization  of  r(t) 
the  same.  In  other  words,  it  makes  all  realizations 
equal  in  that  sense. 

Even  though  the  goals  seem  similar,  the  approaches 
are  not.  In  contrast  to  [18],  the  present  equalization 
is  direct;  the  homogeneous  parts  of  the  two  forms  are 
made  eoual  via  feedback.  In  this  section  a  sufficient 
condition  is  given  for  the  existence  of  K  and  a  simple 
computational  algorithm  based  on  the  Kronecker-product 
and  the  generelized-inverse  techniques  is  proposed.  As 
stated  above,  the  following  results  hold  for  N»2  only. 

Theorem  A 

Given  the  F2-aystem 

d  ■  AjX  +  BjU,  i  •  1,2.  (34) 


such  that 

RangelAj-Aj  Bj  ]  -B2]  “  Range[Bi  |  -B2], 
then,  there  exists  a  minimum-norm  G=[Kj  J  K2]  such  that 

A1  ‘  B1K1  “  a2  *  b2k2-  (36) 

Moreover,  the  G  matrix  is  given  by 

s(G)  -  {[Bj  '  -B2]+  8  In)s( A j -A2 ) ,  (37) 

where  s(.)  is  the  stacking  operator,  ( . )+  is  the 
generelized-inverse,  and  0  is  the  Kronecker-product. 

Proof 

Starting  with  the  forms  Z j,  i«l,  2,  the  1th  model 
is  given  by 

x(t)  “  AjX  +  BjU.  (38) 

If  we  define  A  s  Aj-6Aj  the  above  system  can  be  written 
as  follows 

x(t)  •  Ax  +  6AjX  +  BjU,  (39) 

The  next  step  is  to  compute  a  gain  matrix  Kj  such  that 

6A  jX  +  BjU  “  4AjX  -  BjKjX 

-  (6Aj  -  BiKi)x  -0  for  all  x.  (40) 

This  is  equivalent  to 

BjKj  -  6Aj,  (41) 

which  is  an  algebraic  equation  for  the  unknown  Kj . 
Substructing  the  first  equation  from  the  second  yields 

Aj  -  A2  »  BjKj  -  B2K2  (42) 

which  can  be  written  as  follows 

[Aj  -  A2]  -  [Bj , -B2] [KJ ,KJ] '  (43) 

which  is  itself  an  algebraic  equation  with  Kj ,  i*l,  2, 
as  unknown  and  the  condition  given  in  the  theorem  is 
the  one  needed  for  the  existence  of  both  gains. 
Equation  (37)  is  a  compact  way  to  write  these 
aquations,  and  is  also  useful  when  numerical  techniques 
are  used  to  solve  the'problem. 

Corol larv 

If 

Range[Aj-A2  |  Bj-B2]  »  Range[B1-B2] ,  (44) 

then 

K  *  Kj  -  K2,  (45) 

and  the  gain  matrix  is  given  by: 

s(K)  -  UB1-B2]+  0  In)s(Aj-A2).  (46) 

To  illustrate  the  above  results  consider  the 
following  example. 

Example 

Given  the  following  F2-systam 
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2 

2 

1 

A1  “ 

0 

1 

.  bx  - 

0 

l 

0 

0 

a2  “ 

1 

-1 

.  b2  - 

-1 

. 

. 

. 

it  is  easy  to  check  that  after  closing  the  loop  via  the 
gain  K»(l  2),  the  system  A-matrix  becomes  the  identity 
for  i-1,  2. 

4.  CONCLUSION 

In  this  paper  the  errors  introduced  by  averaging 
hybrid  systems  have  been  analyzed.  Two  errors  have 
been  Identified  and  discussed.  The  first  is  the  error 
induced  by  trucating  the  BCH  expression  while  computing 
the  average  A-matrix.  The  second  is  the  error  between 
the  actual  hybrid  system  and  its  average.  The  paper 
provides  two  upper  bounds  for  the  first  error  along 
with  a  simple  sufficient  stabilTty  criterion  for  the 
second  one. 

It  is  important  to  note  that  the  results  of  this 
paper  are  independent  of  the  averaging  technique  used 
to  compute  the  average  system.  Thus,  the  results  can 
be  used,  for  instance,  when  the  switching  is  governed 
by  a  stochastic  process  such  as  a  Markov  chain  and  the 
average  system  is  the  probabilistic  average  of  the 
hybrid  system.  However,  the  stability  tests  used  in 
the  paper  have  to  be  adjusted  accordingly.  As  a  matter 
of  fact,  imposing  some  ergodicity  conditions  on  the 
dynamics  of  the  hybrid  system  yields  exactly  the  same 
sufficient  stability  condition  when  properly 
interpreted. 

A  different  direction  that  might  be  helpful  in 
deriving  better  stability  conditions  for  such  systems 
is  to  think  of  every  system  Si»(A^,Bi,Cj)  with  ieN  as 
an  operator  acting  on  the  state  x  during  4tiP  and  these 
operators  are  applied  in  a  successive  manner,  then  this 
process  can  be  viewed  as  an  iterative  process  [19]. 
Viewing  a  hybrid  system  as  an  iterative  process  sheds 
some  light  on  the  complexity  of  the  stability  of  such 
systems. 

It  would  be  interesting  to  compare  the  present 
work  and  [9]  to  [20]  where  a  different  type  of 
averaging  hybrid  systems  is  discussed. 
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ABSTRACT 

The  stabilization  of  hybrid  systems  with  a  non- 
svitching  gain  is  cheaper  and  simpler  to  implement 
than  the  switching  one.  One  approach  to  the  design 
of  a  non-switching  gain  is  based  on  the  averaging 
of  the  hybrid  system.  For  obvious  reasons,  the 
non-switching  gain  exists  if  the  average  system  is 
controllable.  In  this  paper,  the  minimality  of  the 
average  system  is  investigated  and  a  sufficient 
criterion  is  derived.  Furthermore,  these  results 
also  shed  some  light  on  Che  topology  of  minimal  LTl 
systems  in  parameter  space. 


nonswitching  control  gains  uy  be  preferable  in 
addition  to  the  fact  that  they  are  much  easier  to 
Implement. 

The  paper  also  considers  deterministic  hybrid 
systems,  when  such  systems  capture  the  essence  of 
stochastic  hybrid  systems.  In  this  case  the 
addition  to  the  fact  that  they  are  much  easier  to 
implement. 

The  hybrid  systems  considered  in  this  paper  are 
assumed  to  have  the  form 


1.  INTRODUCTION  AND  PROBLEM  FORMULATION 
Hybrid  systems  are  a. special  class  of  piece-wise 
constant  time-varying  systems.  Such  models  switch 
at  different  time  instants  among  a  finite  set  of 
linear  time- Invariant  realizations.  Systems  of 
this  type  can  be  used  to  model  systems  subject  to 
known  abrupt  parameter  variations  such  as 
synchronously  switched  linear  systems^,  networks 
with  periodically  varying  switches2  or  to 
approximate  some  types  of  time-varying  systems3. 
This  is  achieved  by  imposing  a  deterministic 
switching  rule2*.  However,  to  model  unknown  abrupt 
phenomena  such  as  systems  subject  to  failures^  the 
switching  can  be  modeled,  for  example,  as  a  finite- 
state  Markov  chain  (FS MC).  An  earlier  review  of 
hybrid  systems  may  be  found  in  Sworder's  paper6. 


x(t)  »  A(r(t))x(t)  +  B(r(t))u(t)  (1.1) 

y(t)  »  C(r(t))x(t)  (1.2) 

where  x  is  the  system  state  vector  of  dimension  n 
u  is  the  control  input  vector  of  dimension  p,  y 
the  output  vector  of  dimension  m,  and  r(t)  is  the 
"form  index"  which  is  either  a  deterministic  or  a 
stochastic  scalar  sequence  taking  values  in  the 
finite  index  set  IN{1,  2,  ...,  N}. 

The  system  takes  the  realization  Sj“(Aj ,Bj ,Cj) 
when  r(t)  «  i,  with  icN.  This  realization  is 
called  the  ith  form.  6tj  denotes  the  time 
interval  during  which  r(t)  -  i.  In  addition  we 
define 


Averaging  theory,  which  is  used  in  a  deterministic 
or  probabilistic  context,  is  an  approach  to  the 
approximation  of  such  systems  by  a  single  constant 
linear  model.  In  the  probabilistic  case  averaging 
is  introduced  in  a  natural  way  by  taking  expected 
values.  In  the  deterministic  case,  however, 

averaging  is  introduced  via  perturbation 
6echniques.  Brockett  and  Wood2  used  a 

deterministic  averaging  technique  to  analyze  and 
stabilize  a  class  of  bilinear  systems  which  are 
very  hard  to  analyze  or  control  otherwise.  Geman2 
used  probabilistic  averaging  techniques  to  study 
the  stability  of  random  differential  equations. 
His  main  interest  was  to  explore  the  relation 
between  asymptotic  stability  in  the  average 

equation,  and  asymptotic  stability  in  the  random 
equation.  "Specifically,  when  doe*  the  first  Imply 
the  second?  Kosut  et  al . 2  applied  the  theory  of 
averaging  to  the  analysis  of  the  stability  of 
adaptive  systems.  Ezzine  and  Haddad’  used  an 

averaging  technique  very  similar  to  the  one  used  in 
Brockett  et  al.'s  paper2  to  analyze  and  stabilize 
hybrid  systems  via  a  nonswitching  gain.  As  a 
matter  of  fact,  Mariton  et  al.^  showed  that 


N 

T  i  E  dtj  (2) 

i-1 

as  the  period  of  the  system. 

Sometimes  it  is  more  convenient  to  represent  the 
hybrid  system  in  an  equivalent  different  fora 
which  leads  to  the  following  representation 

N  N 

x(t)  -  {  l  vi(t)Ai)x(t)  +  {  Z  vj( t )Bj }u( t )( 3. 1 ) 
i-1  i-1 

N 

y(t)  -  (  l  vi(t)Ci)x(t)  (3.2) 

1-1 

where  Vj(t)  «  1  when  the  system  is  governed  by  the 
1th  realization  and  v^(t)  -  0  otherwise.  The 
v^(t)  function  is  called  the  1th  Indicator 
function.  It  is  evident  from  the  definition  of 

hybrid  systems  that  at  any  point  in  time  only  one 

of  the  N  indicator  functions  takes  the  value  one. 
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This  paper  mostly  addresses  the  case  where  where 
r(t)  Is  a  stochastic  process,  which  is  assumed  to 
be  governed  by  a  Finite  State  Markov  Chain  (FSMC) 
with  probabilities 

Pr{r(t+x)  -  J|r(t)  -  1}  -  Plj(x)>  (4a) 

for  continuous-time  systems.  In  case  the  dynamics 
of  the  hybrid  system  are  discrete  the  transition 
probabilities  are  givin  by 

Pr {r( t+1 )  -  J  |  r(  t)  -  1}  -  _  (42) 


It  is  also  assumed.  Throughout  this  research  work, 


unless  stated  otherwise 
stationary  and  irreducible 


II 


that  the  FSMC  is 


sequence  r(t)  is  assumed  to  be  deterministic  and  to 
be  composed  of  a  succession  of  N-termed  blocks. 
Every  block  is  a  permutation  of  the  index  set  M. 
It  is  important  to  note  that  the  succession  of  the 
blocks  is  completely  arbitrary  (e.g.,  for  N"3,  a 
possible  r( t) -sequence  is: 
123,321,213,213,312,...). 

The  following  is  an  outline  of  the  paper.  Section 
2  begins  with  an  overview  of  the  averaging 
techniques  for  hybrid  systems.  In  section  3  the 
controllability  of  hybrid  systems  is  recalled  along 
with  a  necessary  contollabillty  condition  and  a 
stabilization  result.  Section  4  deals  with  the 
controllability  of  the  average  of  a  hybrid  system. 
This  result  is  used  to  derive  the  main  theorem  of 
the  paper,  which  identifies  a  class  of  hybrid 
systems  for  which  the  average  is  minimal.  Section 
5  concludes  the  paper  and  points  to  additional  open 
questions. 


2.  THE  AVERAGE  SYSTHi 

Two  averaging  techniques  are  concidered  in  this 
paper.  The  first  one  is  the  probabilistic  average 
of  the  hybrid  system  when  the  switching  is  governed 
by  an  irreducible  FSMC.  The  second  one  is  the 
first  order  average  of  the  hybrid  system  when  the 
switching  Is  deterministic  as  discussed  above.  The 
latter  average  is  based  on  the  Backar-Campbell- 
Hausdorff  formula2. 

The  two  averages  are  Identical  in  form.  In  fact, 
both'  averages  are  weighted  averages.  In  the 
probabilistic  case  the  weights  are  the  components 
of  the  stationary  probability  vector  of  the  FSMC. 
In  the  deterministic  case  the  weights  are  the 
relative  time-spans  spent  by  each  form.  Therefore, 
the  following  representation,  of  the  average,  is 
adapted  for  both  cases  in  the  rest  of  the  paper 


3.  COWTSOLLABUJTY  OF  HYTOTn  SYSTEMS 
Before  dealing  with  the  controllability  of  the 
average  system  a  definition  of  the  controllability 

of  deterministic  hybrid  system s4  is  proposed  along 
with  few  related  results. 

Definition: 

A  deterministic  hybrid  system  Is  said  to  be  state- 
controllable  if  for  any  t0  each  state  x(t„)  can  be 

transferred  to  any  final  state  Xf  after  one 
period.  Thus  there  exists  a  tf,  t„+TStf<«  such 
that  x(tf)«xf. 

The  next  result  is  a  necessary  algebraic 
controllability  condition.  Basically  the  theorem 
says  that  in  order  for  the  hybrid  system  to  be 
controllable  it  is  necessary  that  the  sum  of  the 
controllable  subspaces  of  the  forms  to  be  equal  to 
the  whole  space. 

Theorem* 

A  necessary  algebraic  condition  for  a 
deterministic  hybrid  system  to  be  controllable  is 

ranktC^  Qj.  ••••  Cm ]  -  rank  C  «  n.  (8) 

Where  is  the  controllability  matrix  for  the  ith 
form,  icM. 

Due  to  the  importance  of  this  theorem  a  heuristic 
proof  presented  in  Ezzine  et  al.s  paper4  that 
shows  that  the  above  condition  is  almost 
sufficient  will  be  repeated  here.  The  heuristic 
argument  can  be  given  as  follows:  Since  any  matrix 
exponential  is  a  perturbation  of  the  identity 
matrix  it  follows  that  multiplying  any  matrix  with 
matrix  exponentials  will  not  change  its  range 
space  drastically.  That  is  if,  for  example,  Cj 
and  Q2  have  algebraic  complementary  range  spaces 
(i.e,  range(Ci)  is  perpendicular  to  rangei^)) 
then  range(Exp(AT)Ci )  will  almost  always  remain  an 
algebraic  complement  but  not  necessarely 
perpendicular  to  range^)-  As  a  matter  of  fact, 
Mariton*2  states  that  he  has  proved  that  theorem  6 
is  also  a  sufficient  condition  when  the  switching 
is  governed  by  a  continuous  FSMC. 

Using  the  above  definition  it  is  possible  to  prove 
in  a  classical  way4  that  deterministic  hybrid 
systems  are  uniformly  completely  controllable  iff 
they  are  controllable.  Therefore,  the  above 
algebraic  condition  plays  almost  exactly  the  same 
role  as  the  usual  sigebraic  condition  for  LTI 
systems. 


Ea  *  aiEi  +  a2^"2  ■*■•••  +  aN^N*  (5) 

where  the  a^zO  are  the  steady  state  probabilities 
of  r(t)«i  for  the  stochastic  case,  and  are  defined 

as 

°i  “  8ti/T  (6) 

for  the  deterministic  case  with 
N 

T  -  Z  6tt  (7) 


At  this  point  and  in  the  light  of  the  preceding 
paragraphs  we  would  lika  to  mention  the  work  of 
tkeda  et  al.^2.  In  their  work  they  looked  at  the 
relation  between  controllability  properties  of  the 
system  end  various  degrees  of  stability  of  the 
closed  loop  system  resulting  from  linear  feedback 
of  the  state  variables.  Their  results  are  as 
follows:  For  any  Initial  time  t0,  end  any 
continuous  and  monotonically  nondecreasing 
function  £(.,t,)  such  that  .  4(t0,t,)»0,  the 
transition  matrix  <>(.,.)  of  the  closed  loop  system 
can  be  made  such  that  |4( t , t„)|Sa( t0 )Exp{ -6( t , t0 )) 
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for  *11  *  tit,,  iff  th«  aystam  1*  completely 
controllable.  Furthermore,  In  c**e  of  a  bounded 
aystam,  for  any  mSO,  a  bounded  feedback  matrix  can 
be  found  auch  that  l^(t],t1)iSa£xp{*m(t]'t1)}  for 
all  tj,  tjit,,  iff  the  ayetam  la  uniformly 
completely  controllable.  Thus,  their  reaulta  can 
be  regarded,  in  some  sense,  as  extensions  of  the 
well  known  results  of  closed  loop  pole  assignment 
for  time-invariant  systems. 

Hence,  there  is  a  high  degree  of  flexibility  in  the 
stabilization  of  hybrid  systems  if  they  are 
controllable  or,  equivalently,  uniformly  completely 
controllable. 

As  an  illustration  of  the  above  results  we  recall, 
with  a  slight  generalization,  a  stabilization 
theorem4  for  hybrid  systems  where  averaging  is 
used. 

Definition 

A  hybrid  system  is  almost  surely  stabilizable  if 
there  exists  a  constant  feedback  gain  matrix  X  such 
that  the  closed  loop  hybrid  system  is 
asymptotically  stable. 

Theorem4 

A  hybrid  system  is  almost  surely  stabilizable  if 

a.  The  average  system  is  stabilizable, 

b.  The  following  inequality  is  almost  surely 
satisfied 

N 

£  uldAi  -  SBjKJpi  <  0,  (9) 

i-1 

where  X  is  s  stabilizing  gain  matrix  of  the 
average  system,  6T^a(6Ai,  SBj)  is  the  difference 
between  the  1th  realization  and  the  average 
system,  and  u(.)  is  the  logarithmic  norm  of 
(-)14. 

A.  COWTBOLLABILm  OF  THE  AVERAGE  SYSTKH 
We  now  turn  to  the  controllability  of  the  average 
system  in  light  of  the  above  theorem.  One  of  the 
key  assumptions  made  to  design  the  regulator  via 
averaging  is  the  controllability  of  the  average 
system.  This  assumption  is  not  unreasonable  since 
the  controllability  property  of  linear  time 
invariant  systems  is  generic.  However,  one  can 
construct  hybrid  systems  such  that  their  averages 
are  not  controllable;  such  a  system  is  a  2-form 
hybrid  systen  (72-system): 

£j  “  (Aj,bj)  and  £3  *  ^ A2 ’ b2^ 

wi  th 


-1  1 

1 

A1  “ 

•  bl  ' 

0  0 

1 

• 

0  1 

1 

a2  - 

•  >>2  " 

0  1 

0 

.  i 

.  . 

It  is  easy  to  chack  that  neither  £3  nor  £2  is 


controllable  but  that  th*  72-system  is  .  After 
derivation  of  the  average  system  th*  determinant 
of  its  controllability  matrix  aa  a  function  of  a 
was  computed.  The  latter  determinant  la  given  by 
the  following  third  order  polynomial: 

P(a)  •  a(2a2  -  3a  +  1 ) . 

The  zeros  of  the  above  polynomial  are  aj«0.  . 

and  03". 5.  The  two  first  zeros  aj  and  02  are  a 
consequence  of  the  fact  that  Ej  and  E2  are  both 
uncontrollable.  However,  the  zero  03-. 5  is  a 
result  of  the  averaging,  therefore  it  refutes  the 
claim  that  the  average  system  of  a  controllable 
hybrid  system  is  necessarily  controllable. 

In  th  sequel  we  give  a  sufficient  condition  that 
identifies  a  class  of  hybrid  systems  for  which  the 
hybrid  system's  controllability  guaranties  the 
controllability  of  the  average  system. 

Theorem  1 

The  average  system  of  a  N-form  hybrid  system  is 
controllable  if 

a-  rank  (Cj,  Q2,  ....  Cfj)  “  b>  where  Cj  is  the 
controllability  matrix  of  the  ith  form, 

b-  All  forms  are  simultaneously  diagonalizable , 

Proof : 

The  average  system  £-(A,B)  is  given  by 

N  N 

A  “  E  OjAj ,  and  B  -  E  o^B^  (10) 

i-1  i-1 

with 

N 

E  a,  -  1,  and  03  *  0  for  i»  1,  2,  ..,  N.  (11) 

i-1 

Since  all  forms,  i-1,  2 .  N,  are 

diagonalizable  with  the  same  similarity 
transformation  then 

T_1AiT  «  Ai(  (12) 

T^Bj  -  Tj  for  i-1,  2 . N.  (13) 

The  T  matrix  is  the  conaon  modal  matrix  for  all 
the  forms,  and  A3  is  the  diagonal  matrix 
corresponding  to  the  i-th  form. 

Using  the  above  result  the  average  system  can  be 
transformed  to  the  following  form 

A  *  T_IAT  -  T"l(Ej  OjA^T  -  £3  0^3,  (14) 

and 

T  *  T_1B  -  T'kEj  a^Bj )  -  £3  0^.  (15) 

Because  of  assumption  (b)  A  will  be  diagonal  too. 
Now,  invoking  condition  (a)  every  tow  in  the  r 
matrix  must  have  at  least  one  nonzero  entry^ 
which  concludes  the  proof. 

Using  theorem  1  and  the  duality  principle  it  is 
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easy  Co  derive  Che  following  theorem. 


Theorem  2 

The  average  system  of  a  N-form  hybrid  sysCem  is 
minimal  provided  Chat 


a-  rank[Cj,  C2*  ••••  Ctf!  “  n>  where  is  Che 
controllability  matrix  of  the  ith  form, 


b- 


rank 


»n 


n. 


where  Oi  is  Che  observability  matrix  of  Che  ith 
form. 


c-  All  forms  are  simultaniously  diagonalizable. 


Using  this  Cheorea  it  is  easy  to  design  a  constant 
regulator  for  hybrid  systems,  for  which  the  above 
three  conditions  hold,  using  standard  LTl-design 
techniques.  For  an  example  the  interested  reader 
is  referred  to  Ezzine  et  al.  's  work^. 


5. CONCLUSION 


The  main  result  presented  in  this  paper  is  a 
theorem  that  identifies  a  class  of  hybrid  systems 
for  which  the  average  is  minimal.  The  minimality 
of  the  average  system  is  crucial  if  the  hybrid 
system  is  to  be  stabilized  via  a  nonswitching 
feedback  gain.  Furthermore,  this  result  sheds  some 
light  on  the  topology  of  minimal  LTX  systems  in 
parameter  space. 


The  above  theorem  can,  probably,  be  generalized  to 
the  case  where  all  forms  are  simultaniously 
transformable  to  Jordan  canonical  forms  with  the 
additional  condition  that  the  geometric 
multiplicities  of  all  eigenvalues  of  the  avarage 
system  are  equal  to  one. 


It  is,  again,  obvious  that  Che  necessary  part  of 
the  controllability  criterion  of  the  hybrid  -ystem 
plays  an  important  role  in  the  controllability  of 
the  average  system.  This  need  for  this  condition 
makes  it  still  more  important  to  be  studied 
closely. 


The  issue  of  the  minimality  of  average  systems  can 
be  studied  in  a  more  systematic  way  by  defining  an 
appropriate  topology  on  the  parameters  space:  works 


such  as  Cobb's  paper*-**,  Eising's  paper*-^,  and 


references  therein  can  provide  a  good  start  in  this 
direction.  Nevertheless,  the  paper's  results  point 
to  directions  that  can  be  helpful  in  defining  such 
topology.  Moreover,  Che  last  theorem  tells  us  that 
given  a  set  of  simultaniously  diagonalizable 
systems  then  every  element  in  the  set  of  all 
averages,  as  defined  in  this  paper,  is 
controllable.  This  is  an  interesting  result  in  its 
own  right. 


Another  research  direction  concerning  the 
minimality  of  the  average  system  is  the  application 
of  the  results  in  Anderson's  paper*®  where 
structural  controllability  and  matrix  nets  are 
studied . 
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ABSTRACT 

This  paper  develops  techniques  for  the 
analysis,  control  and  stabilization  of  hybrid 
systems.  These  systems  svitch  among  a  finite  set 
of  linear  time- invariant  models  with  switching 
behavior  governed  by  a  Finite  State  Markov  Chain. 
The  relationship  of  these  techniques  to  standard 
methodologies  for  linear  time -invariant  systems 
is  also  considered. 

1.  INTRODUCTION  AND  PROBLEM  FORMULATION 
1.1  INTRODUCTION 

Many  real  systems  such  as  power  systems  (1- 
3]  exhibit  variations  in  their  structures  or 
abrupt  changes  in  their  inputs  or  internal 
variables  and  other  system  parameters.  Standard 
linear  time- invariant  systems  models  can  not 
adequatly  represent  these  systems.  Consequently, 
a  new  class  of  systems  has  been  proposed  to  model 
such  systems  (4-14/ .  This  class  is  called  hybrid 
systems  due  to  the  existence  of  both  discrete  and 
continuous  variables  in  their  state  space.  Such 
systems  have  been  explicitly  or  implicitly  used 
in  past  research  [1/. 

Systems  of  this  type  can  be  used  to  model 
networks  with  periodically  varying  switches 
(15,16),  synchronously  switched  linear  systems 
[17],  multi-rate  Sampled-data  systems  [18-22], 
systems  subject  to  failures  [1,23-25], 
manufacturing  systems  [24,25],  large  scale 
flexible  structures  [13],  and  last  but  not  least 
macroeconomic  models  [5]. 

The  objective  of  this  paper  is  to  develop 
methodologies  or,  at  least,  to  provide  the 
necessary  foundations  for  the  analysis  and  design 
of  such  systems,  with  emphasis  on  their 
stabilization.  These  design  tools  are  expected 
to  aid  in  the  design  of  controllers  to  stabilize 
such  systems  and  to  achieve  reliable  performance 
despite  the  changes. 

It  is  customary  to  assume  that  these  systems 
switch  among  a  finite  set  of  linear  models 
according  to  an  irreducible  FSMC.  Hence,  the 
approach  is  based  on  the  mathematical  theory  of 
ergodic  stochastic  processes.  Interestingly 
enough,  the  concepts  of  eigenvalues  and 
eigenspaces  are  generalizable  within  the  ergodic 
theory  framework.  Therefore,  the  key  design  idea 
of  eigenvalues  assignment  for  hybrid  systems 
remains  meaningful  despite  the  time-variation  and 
random  nature  of  these  systems. 

Earlier  major  works  on  the  subject  exhibit 
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two  important  points.  The  first  one  is  the 
almost  exclusive  usage  of  Stochastic  Dynamic 
Programming  (SDP)  as  a  tool  to  adress  the  optimal 
control  and  stabilization  of  hybrid  systems.  The 
second  one  is  the  difficulties  related  to  the 
solution  (i.e.,  existence  and  uniqueness  issues) 
of  the  coupled  Riccati-like  equations  derived  via 
SDP.  These  equations  play  the  same  role  played 
by  the  familiar  Riccati  equations  in  LQR  theory. 

SDP  is  a  very  convenient  tool  to  solve 
optimization  problems.  However,  besides  the  well 
known  "curse  of  dimentionality" ,  the  systematic 
application  of  SDP  does  not  allow  the  user  to 
gain  insight  about  this  complex  problem.  In 
general  the  use  of  SDP  obscures  most  of  the 
useful  properties  of  these  systems.  These 
properties  can  reveal  the  geometric  and  algebraic 
structures  of  hybrid  systems.  As  a  matter  of 
fact,  many  of  the  results  presented  in  [26], 
[14],  and  [27]  show  that  these  systems,  despite 
their  time-variation  and  random  nature,  share 
many  useful  properties  with  LTI  systems. 

Our  approach  avoids  the  difficulties  faced 
by  previous  researchers  and  exploits  the 
similarities  between  LTI  systems  and  hybrid 
systems.  In  particular,  due  to  the  simplicity 
and  success  of  the  eigenvalues  assignment  design 
technique  in  the  stabization  of  LTI  systems,  we 
will  follow  a  similar  approach. 

In  order  to  address  the  stabilization 
problem  of  hybrid  systems  one  needs  a  simple 
stability  criterion  for  this  class  of  systems. 
Hence,  some  conditions  for  determining  the 
stability  of  hybrid  systems  are  first  developed 
based  on  a  generalization  of  the  eigenvalues 
concept.  The  stabilization  approach  uses  the 
largest  Lyapunov  exponent  along  with  some 
controllability  properties  of  hybrid  systems. 

The  following  is  an  outline  of  the  paper. 
After  the  problem  formulation.  Section  2 
introduces  the  material  needed  to  discuss  the 
almost  sure  stabilization  of  hybrid  systems. 
This  section  also  addresses  tne  stability  of  both 
continuous  and  discrete  time  hybrid  systems  and 
simple  sufficient  stability  criteria  are  derived. 
In  section  3  the  almost  sure  stabilizability 
result  is  discussed.  This  result  is  a 
generalization  of  Wonham  stabilization  theorem  to 
this  class  of  systems.  Section  4  concludes  the 
paper. 

1.2  PROBLEM  FORMULATION 

The  hybrid  systems  considered  in  this  paper 
are  assumed  to  have  the  form 
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x(t)  »  A(r(t))x(t)  +  B(r(t))u(t)  (l.a) 

y(t)  -  C(r<t))x(t)  (l.b) 

where  x  is  Che  system  state  vector  of  dimension 
n,  u  is  the  control  input  vector  of  dimension  p, 
y  is  the  output  vector  of  dimension  m,  and  r(t) 
is  the  "form  index"  which  is  either  a 
deterministic  or  a  stochastic  scalar  sequence 
taking  values  in  the  finite  index  set  J*«{1>  2, 
....  N>. 

The  system  takes  the  realization 
Ei«(Ai,B|,Ci)  when  r(t)  -  i,  with  icN.  This 
realization  is  called  the  ith  form.  Let  <5t^ 
denote  the  time  interval  during  which  r(t)  -  1. 
In  addition  we  define 
N 

T  i  I  «tt  (2) 

i-1 

as  the  period  of  the  system. 

Sometimes  it  is  more  convenient  to  represent 
the  hybrid  system  in  an  equivalent  different  form 
which  leads  to  the  following  representation 

N  N 

x(t)  -  {  Z  vi(t)Ai}x(t)  +  {  Z  v1(t)Bi)u(fi)a) 
i-1  i-1 

N 

y(t)  -  {  £  vi(t)Ci}x(t)  (3.b) 

i-1 

where  vt(t)  -  1  when  the  system  is  governed  by 
the  ith  realization  1^,  and  v^it)  »  0  otherwise. 
The  Vj(t)  function  is  called  the  1th  indicator 
function.  It  is  evident  from  the  definition  of 
hybrid  systems  that  at  any  point  in  time  only  one 
of  the  N  indicator  functions  takes  the  value  one. 

Most  of  our  work  will  address  the  situation 
where  r(t)  is  a  stochastic  process,  more 

presisely,  r(t)  will  be  assumed  to  be  a  Finite 

State  Markov  Chaine  (FSMC)  with  transition 
probabilities 

PfrCt+dti)  -  j | r( t )  -  i)  -  pij,  (4. a) 

for  continuous -time  systems,  and 

P{r(t+-1 )  -  j  |  r(  t)  -  i}  -  p.j,  (4 .  b) 

for  discrete-time  systems.  It  is  also  assumed, 
unless  stated  otherwise,  that  the  FSMC  is 
stationary  and  irreducible  [28], 

Sometimes  r(t)  is  defined  as  a  special 
deterministic  process.  It  will  be  obvious  from 
the  definition  that  r(t)  is  very  similar  to  the 

FSMC  defined  above.  This  is  done  in  order  to 

show  that  a  deterministic  formulation  is 
sufficient  to  answe-  certain  questions.  In  this 
case  it  is  assumed  that  any  r(t)  sequence  is 
composed  of  a  succession  of  N-termed  blocks, 
where  every  block  is  a  permutation  of  the  index 
set  N.  It  is  important  to  note  that  the 
succession  of  the  blocks  is  completely  arbitrary 
(e.g.,  for  N-3,  a  possible  r(tl -sequence  is: 
123,321,213.213.312,...). 

2.  STABILITY  OF  HYBRID  SYSTEMS 

2 . 1  Lyapunov  Exponents 

In  1892,  A.  M.  Lyapunov  founded  the  theory 


of  characteristic  exponents  that  bear  his  name 
[29], 

His  intention  was  to  determine  criteria  for 
the  stability  (of  the  origine  x=0)  of 

x  »  A(t)x,  x( 0;  x0)  -  x0  c  Rn,  t  c  R+,  (5) 

A(t)is  continuous  and  bounded. 

For  constant  A  the  eigenvalues  of  A 
determine  the  stability  behavior  of  (S).  For 
periodic  A(t)  Floquet  theory  shows  that  the 
results  for  constant  A  remain  true  if  the  real 
parts  of  eigenvalues  are  replaced  by  the 
characteristic  exponents  of  A(t)  [30,  chapters  3 
and  13). 

The  Lyapunov  exponent  of  a  solution  x(t;  x3) 
is  defined  by 

A(x0)  i  lim  sup  (l/t)Loglx(f ;  x0)l.  (6) 

t— 

Lyapunov  proved  that  for  every  solution  with 
x„*0,  A(x0)  is  finite.  Moreover,  the  set  of  all 
possible  numbers  which  are  Lyapunov  exponents  of 
some  nonzero  solution  of  (5)  is  finite,  with 
cardinality  p,  such  that  ISpSn  and  A_<  ...  <  A}. 

Furthermore,  Lyapunov  proved  that  the 
subspaces 

Lt  *  (x,  c  Rn  :  AtxoJSAi,  i-1 .  p+1} 

form  a  filtration  of  Rn,  1.*., 

o  -  Vi  S  •••  L1-R"’ 

with  dimLj-kj,  such  that 

kp+l  “  0  <  kp  <  ...  <  kj  -  n, 

and 

X(x0)  -  iff  x0  c  Li\Li+1,  1  -  1,  ....  p. 

The  numbers  A^  together  with  their 
multiplicities  d^  are  called  the  Lyapunov 
spectrum  of  (5).  The  asymptotic  behavior  of  (5) 
is  dictated  by  Aj .  That  is,  (5)  is  exponentially 
stable  iff  A}<0. 

Unfortunatly ,  it  is  in  general  not  true  that 
Aj<0  implies  the  stability  of  the  following 
nonlinear  system: 

x  -  A(t)x  +  f(t,  x).  (7) 

However,  for  a  special  class  of  f(t,  x)  the  above 
is  true  if  (5)  is  what  Lyapunov  calls  regular . 
For  regular  systems  the  following  holds 

A^  -  lim  (l/t)Loglx(t;  x,)l.  (8) 

For  example,  (5)  is  regular  if  A  is  constant  or 
periodic.  In  the  latter  case  A^  are  the 
characteristic  exponents. 

Regularity  is  hard  to  verify  for  a 
particular  system,  but  it  happens  with 
probability  one  in  many  cases  involving  a  flow 
with  an  invariant  probability  measure  [31).  This 
is  how  Birkoff's  ergodic  theorem  [32]  and  ergodic 
theory  in  general  comes  in  to  exploit  Lyapunov 
powerful  spectral  theory. 

As  shown  above  there  are  many  similarities 
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between  eigenvalues  and  eigenspaces  of  constant 
matrices  and  the  Lyapuno  spectrax  theory  of 
time-varying  systems  -  The  main  similarity  that 
will  be  of  major  importance  in  this  paper  is  the 
stability  role  played  by  the  largest  Lyapunov 
exponent  A^;  A^  plays  the  same  role  as  the 
largest  eigenvalue  plays  in  the  stability  of 
time- invariant  systems. 

2.2  Continuous-time  Hybrid  Systems 

Eventhough  the  sign  of  A^  is  a  necessary  and 
sufficient  test  for  the  stability  of  a  hybrid 
system,  it  is  almost  impossible  to  compute.  In 
this  section  we  will  use  the  Lyapunov  exponent 
along  with  the  logarithmic  norm  concept  to  derive 
a  simple  sufficient  stability  test  for 
continuous -time  hybrid  systems. 

In  order  to  derive  uncomplicated  conditions 
for  the  stability  of  such  systems,  a  different 
tool  is  used,  namely  the  logarithmic  norm 
(33,34),  resulting  in  a  simpler  sufficient 
condition. 

The  logarithmic  norm  (also  known  as  the 
logarithmic  derivative,  the  measure  of  a  matrix) 
was  introduced  in  1958  separately  by  Dahlquist 
[33]  and  Lozinskij  [34]  as  a  tool  to  study  the 
growth  of  solutions  to  ordinary  differential 
equations  and  the  error  growth  in  discretization 
methods  for  their  approximate  solution.  It  is 
formally  defined  as  follows: 

Definition  1 

The  logarithmic  norm  of  a  matrix  A 
associated  with  the  matrix  norm  l.l  is  defined  by 

u(A)  -  lim  (II  +  hAI  -  l)/h  (9) 

h*0+ 

Expliclt  expression  for  the  logarithmic  norm 
associated  with  the  Euclidean  norm  is 

u(A)  »  maxiu  :  u  c  A( (A+A*)/2)} .  (10) 

Then  the  following  inequality  is  true: 

Exp(-u(-A)t)  4  IExp(At)l  4  Exp(u(A)t).  (11) 

One  very  important  property  of  the 
logarithmic  norm  follows  from  the  fact  that  it 
may  be  shown  to  be  the  smallest  element  of 

S  “  (s  :  IExp(At)l  4  Exp(st),  tkO).  (12) 

Therefore  it  gives  an  optimal  bound  on  the 
exponential  behavior  of  IExp(At)l  for  t£0.  It  may 
be  concluded  therefor  that 

suplExp(At)l  "  1  iff  u(A)S0.  (13) 

tiQ 

In  the  case  where  A  is  normal  square  matrix 
(i.e.,  A*A«AA*),  then 

IExp(At)l  ■  Exp(a(A)t)  ”  Exp(u(A)t)( 14) 

where  a(A)  is  the  maximal  real  part  of  the 
eigenvalues  of  A.  This  norm  is  now  used  to 
derive  the  stability  condition. 

Theorem  1 

For  the  null  solution  of  the  hybrid  system 
(1)  to  be  a.s.  exponentially  stable,  it  is 


necessary  that 

Z  u(-Ai)pi  >  0,  (15. a) 

i 

and  sufficient  to  have 

Z  u(Ai)pi  <  0,  (15. b) 

i 

where  the  Pj's  are  the  steady-state  probabilities 
of  the  irreducible  FSMC  and  icN. 

The  simple  sufficient  condition  states  that 

for  a  hybrid  system  to  be  a.s.  uniformly 

asymptotically  stable  the  average  of  the 

logarithmic  norms  of  each  realization  has  to  be 
negative.  Therefore,  this  sufficient  condition 
allows  for  unstable  forms.  That  is,  as  long  as 
the  stable  forms  dominate,  the  overal  system  is 
a.s.  exponentially  stable.  This  domination  can 
occure  in  two  ways:  either  the  stable  forms  are 
strongly  stable  (i.e.,  highly  negative 
logarithmic  norms)  or  the  time  span  of  the  stable 
forms  is  large  relative  to  the  time  span  of  the 
unstable  ones  or  a  combination  of  both  reasons. 
This  stability  feature  of  hybrid  systems  was 
reported  in  [23]  via  examples.  The  critera  given 
above  holds  for  hybrid  systems  where  the 
switching  is  deterministic  as  well,  in  which  case 
the  Pi's  represent  relative  time  span  of  the 
forms . 

It  is  clear  from  the  proof  of  the  above 
theorem  that  the  sufficient  almost  sure 
exponential  stability  criterion  is  an  upper  bound 
for  the  largest  Lyapunov  exponent  Aj,  and  is 
simple  to  compute.  Therefore,  this  sufficient 
criterion  can  play  a  beneficial  role  in  testing 
for  the  stability  of  hybrid  systems  as  well  as  in 
the  design  of  controllers  to  stabilize  such 
systems,  as  stated  in  the  following: 

Theorem  2 

The  largest  Lyapumov  exponent  Aj  of  the  null 
solution  of  the  hybrid  system  (1)  satisfies  the 
following  inequality 

A|  -  lim  (l/t)Loglx(t;  x0)l  4  Z  piu(Ai),  (16) 
t-*-  icH. 

where  the  p^'s  are  the  steady-state  probabilities 
of  the  irreducible  FSMC. 

The  difference  between  the  upper  and  lower 
bounds  given  in  theorem  1  give  a  measure  of  the 
conservativeness  of  (15. a).  This  problem  is 
considered  further  in  the  sequel. 

2.3  Discrete-time  Hybrid  Systems 

The  study  of  the  stability,  sample-wise,  of 
discrete-time  hybrid  systems  is  similar  to  the 
study  of  the  solutions  of  stochastic  linear 
difference  equations  with  randomly  varying 
parameters.  This  leed  us  to  the  study  of  the 
following  problem: 

Let  (M,,,  n  c  N)  be  a  sequence  of  random, 
nxn,  matrices.  To  each  x0  c  Rn  one  associates 
the  process  (X,,,  n  e  N)  with  values  in  Rn,  which 
is  the  solution  Co 

Xn+1  -  M^X,,,  n  c  N,  and  X„  -  x„.  (17) 

We  have  “  Mn...MiXa.  One  important 

question  is  what  is  the  asymptotic  behavior  of 
this  process. 
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Furstenberg-Kesten  Theorem  [351 

Let  (M^  i  c  N)  be  a  stationary,  metrically 
transitive  (i.e.,  ergodic)  stochastic  process 
with  values  in  the  set  of  rum  matrices  such  that 
E{Log+IMQj }<».  Then,  with  probability  one 

lim( l/n)E(LoglSnl }  -  inf ( l/n)E{!oglSnl } 
n  n 

”  -  lim( l/n)LoglSnl  2  Xx  c  RU{— },  (18) 

n 

where  Sn  2  X^  is  the  largest  Lyapunov 

exponent  of  the  process. 

Remark 

The  Furstenberg-Kesten  Theorem  (FKT)  is  a 
generalization  of  Birkoff's  ergodic  theorem  to 
the  case  of  matrix  valued  functions. 

The  largest  Lyapunov  exponent  Xj  of  a 
discrete-time  hybrid  system  plays  the  same  role 
as  the  one  played  by  the  largest  Lyapunov 
exponent  of  continous-time  hybrid  systems.  That 
is,  for  a  discrete-time  hybrid  system  to  be  a.s. 
uniformly  asymptotically  stable  it  is  necessary 
and  sufficient  to  have  X^<0.  However,  it  would 
be  very  impractical  to  use  (18)  to  compute  Xj 
(i.e.,  compute  (18)).  Therefore,  one  way  to 
avoid  this  difficulty  is  to  use  the  Furstenberg- 
Kesten  Theorem  to  derive  simpler  criteria  to  test 
for  the  a.s.  stability  of  the  system. 

As  it  is  stated  in  the  FKT,  X^  is  an  infimum 
of  a  particular  set.  Consequently,  if  any  of  the 
elements  of  this  set  is  negative  one  concludes, 
using  the  property  of  the  infimum,  that  the 
system  is  a.s.  exponentially  stable.  However,  by 
not  knowing  exactly  X^,  it  is  not  possible  to 
tell  how  stable  the  system  is.  That  is,  by 
exploiting  this  property  to  alleviate  the 
computational  burden,  we  are  loosing  some 
qualitative  insight  about  the  dynamical  behavior 
of  the  system.  This  qualitative  insight  can  be 
crucial  for  application  purposes.  First  we  need 
additional  definitions. 

Definition  2 

A  set  S"{H^,  i»l . N)  of  nxn  matrices  is 

nilpotent  provided  there  is  a  a  k-termed  sequence 
(Hi),  such  that  k  is  finite  and  the  matrix 

nk  s  Hk  ...  H2H!  (19) 

is  nilpotent.  The  least  number  6  for  which  the 
power  of  the  matrix  nk  is  null  is  called  the 
index  of  nilpotency. 

In  case  S  is  a  singleton  the  above 
definition  is  identical  to  the  usual  definition 
of  nilpotency. 

Proposition  1 

If  S  contains  a  nilpotent  matrix  then  the 
set  is  nilpotent  and  its  index  of  nilpotency  0  is 
less  or  equal  to  the  index  of  nilpotency  of  the 
nilpotent  matrix. 

Definition  3 

The  set  S“(Hit  i«l ,  ....  N>  of  nxn  matrices 
is  said  to  be  contractive  provided  that  there  is 
a  k-termed  finite  sequence  {Hj)  and  a  norm,  such 
that 

IHk  ...  HjHj I  S  a  <  1.  (20) 


At  this  point  we  would  like  to  recall  a 
theorem  that  will  play  a  key  role  in  the  sequel. 
This  theorem  is  the  converse  of  a  well  known 
result  relating  the  norm  of  a  matrix  to  its 
spectral  radius  (i.e.,  a(A)  i  IAI).  The 
following  theorem  asserts  that  there  exists  an 
induced  norm  for  which  the  Inequality  in  the 
previous  result  can  be  reversed  after  adding  an 
arbitrarily  small  positive  number  to  the  matrix 
spectral  radius. 

Theorem  (361 

For  any  e>0  and  any  nxn  real  A  matrix,  there 
is  a  (vector)  norm  on  Rn  such  that  the 
corresponding  induced  norm  satisfies 

IAI  S  a(A)  +  c.  (21) 

Now  we  are  ready  to  apply  the  above 
definitions  and  theorems  to  derive  simple 
sufficient  a.s.  exponential  stability  tests  for 
discrete-time  hybrid  systems.  As  mentioned 
earlier  these  results  will,  in  general,  answer 
the  stability  issue  but  will  not  provide  enough 
information  about  the  qualitative  behavior  of  the 
system.  That  is,  how  fast  or  how  slow  the  system 
is  converging  or  diverging.  However,  the  first 
result  is  an  exact  result  (i.e.,  exact  X^).  It 
allows  a  complete  quantitative  and  qualitative 
analysis  in  an  important  special  case. 

Theorem  3 

A  homogeneous  N-form  hybrid  system  with  a 
stationary  irreducible  FSMC,  is  a.s. 
exponentially  stable  with  X^*— ,  provided  that 
the  set  N  contains  a  nilpotent  set. 

The  first  result  says  that  a  homogeneous  N- 
form  hybrid  system  is  a.s.  stable,  and  its  Xj»— , 
if  the  set  of  N-matrices  of  the  hybrid  system 
contains  a  nilpotent  set. 

This  result  is  the  analog  of  the  stability 
result  of  a  homogeneous  difference  equation  with 
a  nilpotent  matrix.  These  difference  equations 
converge  to  zero  in  no  more  than  n  steps.  That 
is,  by  analogy,  hybrid  systems  with  a  nilpotent 
set  of  matrices  are  very  fast  systems.  This  is 
confirmed  by  the  fact  that  X]*-**. 

The  next  result  is  more  general  but  less 
powerful  in  the  sense  that  it  does  not  provide  us 
with  Xj. 

Theorem  4 

A  homogeneous  N-form  hybrid  system  with  a 
stationary  irreducible  FSMC,  is  a.s. 
exponentially  stable,  provided  that  the  set  N 
contains  a  contractive  set. 

As  an  attempt  to  alleviate  the  shortcomings 
of  the  latter  theorem  we  provide  an  upper  bound 
for  1}  similar  to  the  one  given  for  continuous- 
time  hybrid  systems.  However,  this  bound  does 
not  involve  the  logarithmic  norm  concept,  but  it 
is  derived  via  a  simple  computation. 

Theorem  5 

The  LSR  of  a  homogeneous  N-form  hybrid 
system  with  a  stationary  irreducible  FSMC 
satisfies  the  following  inequality 

N 

Xj  S  s  p^LoglA^I .  (22) 

i"l 

The  next  result  is  based  on  the  work  of  Katz 
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and  Thomas i an  [37].  Along  with  tha  FKT  it 
provides  a  mean  by  which  one  can  estimate  the 
least  number  of  matrix  multiplications  in  the  Ai* 
formula  for  which  the  probability  of  having  a 
large  error  is  minimized. 

Theorem  6 

Given  a  homogeneous  N-form  hybrid  system 
with  a  stationary  and  irreducible  FSMC,  a 
positive  integer  m  and  e> 0.  Then 

N 

P{|(l/n)  £  LoglAj^l  -  LSR*|ie  for  some  n£m; 
i-1 

£  2aExp{-0c^m) ,  (23) 

where 

(3  -  p3N/2852N2,  (24. a) 

a  -  8N/pN(l  -  Exp{ -0e2} ) ,  (24. b) 

with 

6  ”  maxLoglA^I  -  minLoglAj I , (25. a) 
icN  jeN 

LSR+  -  £  piLoglAil.  (25. b) 

icH 

and  p  is  the  largest  entry  in  the  steady-state 
probability  vector. 

This  result  tells  us  when  to  stop  the  time 
average  of  the  upper  bound  and  still  get  a 
good  approximation.  This  might  seem  useless 
since  we  have  a  simple  expression  for  LSR+. 
However,  the  LSR+  is  an  upper  bound  for  A^, 
therefore,  this  stopping  rule  can  be  used  as  an 
approximate  stopping  rule  in  computing  the  LSR. 

3.  ALMOST  SURE  STABILIZATION  OF  HYBRID  SYSTEMS 

3.1  Lvapunov  SPECTRAL  RADIUS  ASSIGNMENT 

In  this  section  we  provide  the  main  result 
of  the  paper;  a  simple  sufficient  a.s. 
stabilization  theorem. 

Theorem  7 

An  m  inputs  n  states  N-form  hybrid  system 
with  a  stationary  and  irreducible  FSMC  is  a.s. 
stabillzable  by  arbitrarily  assigning  its  A} 
provided  that  there  is  a  completely  reachable  K- 
periodic  system,  with  m  inputs  and  n  states, 
embedded  in  the  N  forms,  with  KSN. 

This  result  is  based  on  the  pole-assignment 
for  discrete-time  linear  periodic  system. 
Hernandez  and  Urbano  [38]  extended  the  pole 
assignment  technique  to  Linear  periodic  systems. 
Their  result  is  used  here  to  extend  the  pole 
assignment  to  stochastic  hybrid  systems  by 
assigning  the  largest  Lyapunov  exponent  Xj. 

It  is  possible  under  special  assumptions  to 
arbitrarily  assign  Aj  without  requiring  the 
complete  reachability  of  a  periodic  system.  That 
is,  by  Imposing  a  geometric  relation  among  the  N 
forms.  This  way  the  complete  reachability 
condition  is  weakened  considerably  as  it  is 
stated  in  tha  next  theorem 

Theorem  8 

Given  a  N-form  hybrid  system  with  a 
stationary  and  irreducible  FSMC  such  that 

a.  Rank  (Clt  C2,  ....  CNJ  -  n, 

b.  (A^-B^K^,  icN>  belongs  to  a  solvable  Lie 
algebra, 

then  tha  hybrid  system  can  be  made  exponentially 
stable  with  Aj»»«  a.s. 


Actually  what  the  theorem  says  is  that  the 
Lyapunov  spectral  radius  can  always  be  made 
negative,  more  precisely  if  the  two 

conditions  of  the  theorem  are  met.  The  first 
condition,  as  discussed  several  times  in  [26],  is 
a  necessary  condition  for  the  hybrid  system  to  be 
controllable,  therefore,  it  is  the  weakest 
controllability  condition.  This  condition  is  the 
strongest  part  of  this  theorem.  However,  the 
second  condition  is  quite  restrictive  and  maybe 
impractical. 

4.  CONCLUSION 

The  a.s.  exponential  stability  criteria 
presented  in  this  paper  are  simple  to  compute, 
consequently  they  alleviate  the  computational 
shortcomings  of  Lyapunov  exponents.  However, 
these  tests  are  only  sufficient  and  they  can  be 
quite  conservative,  hence  they  require  further 
study. 

The  a.s.  stabilizability  theorem  can  be 
viewed  as  a  generalization  of  Uonham 
stabilization  theorem.  Actually  what  the  theorem 
says  is  that  the  Lyapunov  spectral  radius  can 
always  be  made  negative  with  probability  one, 
more  precisely  if  certain  conditions  are  met. 

One  additional  problem  of  interest  is  to 
find  wether  the  eigenspaces  idea  carries  over  to 
hybrid  systems  and  its  usefulness. 
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ABSTRACT 

This  paper  analyzes  piecewise-linear  systems 
which  are  singularly  perturbed.  A  technique  is 
developed  that  allows  decoupling  of  such  systems 
into  fast  and  slow  subsystems  for  analysis  and 
design.  The  results  of  a  numerical  example  are 
included  to  demonstrate  this  technique. 


1.  INTRODUCTION 

Piecewise-linear  systems  which  are  singularly 
perturbed  are  found  in  many  applications  includ¬ 
ing  electrical  circuits  and  flight  controls.  The 
piecewise-linearity  may  be  due  to  nonlinear 
elements  such  as  saturation  or  may  result  from  a 
linearization  about  various  operating  points  of  a 
nonlinear  plant.  These  types  of  systems  are 
numerically  very  stiff  and,  hence,  are  difficult 
to  analyze.  This  problem  may  be  alleviated  by 
using  singular  perturbation  theory  to  separate 
the  system  into  reduced-order  models,  one 
containing  the  slow  dynamics  and  one  containing 
the  fast  dynamics.  Reduced-order  models  are 
easier  to  use  in  analysis  and  design  by  lessening 
the  computation  complexity.  In  addition,  time- 
integration  of  the  lower  order  systems  instead  of 
the  full  order  model  reduces  computation  time 
since  a  larger  time  step  can  be  used  for  the  slow 
dynamic  model.  The  use  of  standard  singular 
perturbation  techniques,  however,  requires  that 
the  system  dynamical  equations  be  smooth  [1,2] 
ruling  out  their  use  on  piecewise-linear  systems. 
This  paper  extends  the  general  method  of  singular 
perturbation  for  application  to  continuous 
piecewise-linear  systems. 


1.1  Problem  formulation 


The  system  considered  in  this  paper 
represented  in  the  following  form: 

may  be 

x  «  fjCx.z), 

x(t„)  -  x0 

(1) 

uz  *  fj(x,z). 

*(t0)  -  z„ 

(2) 

where:  ft  and  f2  are  continuous  piecewise-linear 


functions,  u>0  is  a  small  parameter,  and  xcRP  and 
zcRr.  The  functions  are  affine  in  specific 


regions  of  the  state  space  (R1^)  where  a  region 
is  typically  defined  as  an  intersection  of  half- 
spaces.  Por  example,  equations  (1)  and  (2)  are 


represented  in  the  it^>  region  by 

the  following 

"linear"  system: 

.  i  .  ,  i  .  i 

x  «  Alt  x  +  A12  z  +  Wj 

(3) 

.  i  ,  .  i  .  i 

uz  «  A21  x  +  A22  z  +  w2 

(4) 

Por  the  purposes  of  this  paper,  the  i1*1  region  is 
defined  by  the  set  Sj»{(x,z):  d^_j<  K^x+K^z  i  d^} 

where  K  and  K  are  row  vectors  and  d,  ,  <  d.  are 
x  z  i-1  i 

scalars.  By  this  definition,  the  type  of  regions 

allowed  are  parallel  in  that  the  boundaries  do 

not  intersect.  An  example  of  a  physical  system 

which  has  this  description  is  one  in  which  the 

piecewise-linear  element  is  in  a  scalar  feedback 

loop.  The  reason  for  the  restriction  will  be 

discussed  in  Section  2. 

The  system  given  in  equations  (1)  and  (2) 
contains  both  fast  and  slow  dynamics.  The 
variable  x  is  primarily  slow  while  z  has  both 
fast  and  slow  components.  Starting  from  the 
initial  conditions  of  equations  (1)  and  (2),  the 
fast  part  of  z  quickly  dies  out  and  z  converges 
to  a  quasi-steady-state  value  (i.e.,  the  slow 
component)  in  a  short  time  interval  [t0,t0+6) 
known  as  the  boundary  layer.  The  fast  component 
of  z  is  then  known  as  the  boundary  layer  solu¬ 
tion.  The  solution  of  the  system  outside  of  the 
boundary  layer  is  termed  the  outer  solution.  It 
is  desired  to  decouple  system  (l)-(2)  into  fast 
and  slow  models  which  yield  the  boundary  layer 
solution  and  the  outer  solution,  respectively. 
The  boundary  layer  solution  is  then  used  as  a 
correction  term  to  the  outer  solution  so  that  the 
combination  is  an  approximation  for  the  original 
system  with  errors  of  order  0(u)>  A  technique  to 
decouple  the  system  is  developed  in  this  paper. 

The  following  is  an  outline  of  the  paper. 
Section  2  discusses  the  boundary  layer  solution 
and  developes  a  reduced-order  model  to  approxi- 
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mate  this  solution.  The  outer  solution  along 
with  a  corresponding  reduced-order  model  is 
discussed  in  Section  3.  A  numerical  example  is 
presented  in  Section  4  to  demonstrate  the  tech¬ 
niques  developed  in  this  paper.  Section  5  con¬ 
cludes  the  paper. 

2.  BOUNDARY  LAYER  SOLUTION 

The  fast  dynamics  of  the  system  are  most 
prominent  during  the  boundary  layer  and  can  be 
decoupled  from  the  slow  dynamics  by  introducing 
an  expanded  time  scale  x  «  (t-t 0)/q.  Examina¬ 
tion  of  equation  u)  sh-ws  that  ..  stays  rela¬ 
tively  constant  with  respect  to  t  assuming  that 
A21^,  A,  2  *  and  w^  are  bounded  in  all  regions  Sf 
[ 1 ] .  Equation  (2)  may  be  rewritten  as  follows: 

-  f(£)  (5) 

where  z(x)«z(MX+t0)  and  f (z)“f 2(x0 ,z) .  The 

function  f  is  a  continuous  piecewise-linear 
mapping  from  Rr  into  Rr.  The  state  space  in  Rr 
is  partitioned  into  regions  where  the  function  is 
affine;  e.g.,  the  ic^  region  is  defined  as  the 
set  Rj«{z:  d^_ j  <  Kxx0+Kzz  £  d^}.  A  degenerate 

case  where  Kz»0  results  in  the  existence  of  only 

one  region  in  Rr  so  that  f  is  affine  everywhere. 
The  initial  quasi-steady-state  value,  zs(t„),  of 
z(t)  is  a  stable  equilibrium  point  of  (5).  Note 
that  the  equilibrium  point  of  the  degenerate  case 
is  easily  found. 

The  equilibrium  point(s)  of  (5)  for  the 
nondegenerate  case  can  be  found  using  solution 
techniques  developed  for  piecewise-linear  resis¬ 
tive  networks.  Many  papers  have  been  written  on 
finding  the  solution  x  of  the  equation  f(x)«y 
where  f  is  a  continuous  piecewise-linear  func¬ 
tion,  e.g.  (3-9).  Fujisawa  and  Kuh  show  in  [4] 
that  a  continuous  piecewise-linear  function 
satisfies  a  Lipshitz  condition.  The  following 
theorem  from  [4]  gives  sufficient  conditions  for 
the  existence  and  uniqueness  of  the  solution. 

Theorem  1 :  Let  f  be  a  continuous  piecewise- 

linear  mapping  of  Rr  into  itself  and  let 
denote  the  matrix  composed  of  the  first  k  rows 
and  columns  of  the  Jacobian  matrix  J1  in  region 
Rf.  The  mapping  is  a  homeomorphism  of  Rr  onto 
itself  if,  for  each  k«l ,2, . . . ,r,  the  determinants 
of  the  kxk  matrices 

•^k'^k'  •  •  •  >^rk 

do  not  vanish  and  have  the  same  sign. 

This  previous  work  is  used  in  finding  the 
equilibrium  point(s)  of  system  (5)  by  solving 
f(z)«0.  In  this  application,  J*  *  A22*  and  each 
A]]1  is  assuaied  to  be  Hurwitz  for  stability 
purposes.  The  conditions  of  Theorem  1  may  be 
stringent  and  various  other  sufficient  conditions 
for  the  existence  and  uniqueness  of  the  solution 
are  given  in  [9-11).  Also,  reference  [12] 
discusses  nonunique  solutions. 


2.1  Algorithm  to  Solve  for  Equilibrium  Point 

The  Katzenelson  algorithm  is  widely  used 
insolving  for  x  in  the  equation 

f(x)  »  y  (6) 

where  f:Rr-»Rr  is  continuous  and  piecewise- 
linear.  The  basic  outline  of  this  algorithm  used 
in  solving  f(z)"0  is  given  below.  More  details 
of  the  general  method  are  given  in  [4).  Let 

WJ  «  A21Jx0-KiJ  >j ,  and  denote  the  iteration 
number  on  zs  and  A  by  superscripts. 

0)  initialize  by  letting  i»l  and  zsi»z0 

1)  solve  z  ■  -(A22J)  wJ,  where  region  Rj 

contains  zs* 

2)  if  z  lies  in  region  Rj  then  zs  •  z  and  step 

3)  otherwise,  let  R^  be  the  region  containing  z; 
if  k>j  then  d  ■  d^  and  then  let  j“j+l 

if  k<j  then  d  •  ^j-l  an<*  t*ien  let  j“j-l 

4)  solve  A1  »  (Kzzs+Kxx0-d)/Kz(zsi-z) 

5)  solve  zsi+1  •  zs*  -  Ai(zsi-z) 

6)  let  i»i+l  and  go  to  1) 

It  is  shown  in  [4]  that  if  the  piecewise-linear 
function  is  a  homeomorphism  (e.g.,  it  satisfies 
the  conditions  of  Theorem  1)  then  the  algorithm 
will  converge  in  a  finite  number  of  steps. 

2.2  Boundary  Layer  Approximation 

A  fast  model  approximating  the  dynamics 
occurring  in  the  boundary  layer  can  be  found  once 
the  equilibrium  point  of  system  (5)  Is  known. 
The  boundary  layer  solution  is  then  given  as 
Zf(x)  »  z(x)-zs(t„).  In  this  application,  zs 

must  be  found  implicitly  because  f2  is  not 
smooth.  Therefore,  the  fast  model  approximating 
the  boundary  layer  solution  is  given  in  terms  of 
z.  In  the  i*"  region  the  fast  model  is  given  by 

■  Ajj1!,,  +  A221z  +  w,1,  z(0)»z0  (7) 

if (x)  -  z(x)  -  zs(t0) 

where  the  ith  region  is  defined  by  the  set  Rj«{z: 
di-l  <  Kx*«  +  V  S  di>- 

For  the  purposes  of  this  paper,  it  is  assumed 
that  there  exists  exactly  one  equilibrium  point 
which  is  asymptotically  stable.  Multiple  stable 
equilibrium  points  may  be  handled  by  partitioning 
the  state  space  into  domains  of  attraction  for 
the  various  equilibrium  points  and  the  analysis 
in  this  paper  holds  for  each  domain  of  attrac¬ 
tion. 

Asymptotic  stability  is  assumed  in  this  system 
though  there  is  no  known  general  method  for 
determining  asymptotic  stability  of  piecewise- 
linear  systems.  Depending  on  the  specific  system 
under  consideration,  a  Lyapunov  function  may  be 
found.  Another  possibility  is  to  use  standard 
SISO  frequency  domain  techniques  or  hyperstabil- 
lty.  For  using  hyperstability  notions,  system 
(5)  may  be  rewritten  as 
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—  “  Az  +  Bu  (8) 

QT 

where  A  is  chosen  to  be  stable,  B  is  the  identity 
I,  and  u  is  defined  in  the  i*-*1  region  to  be 

u«AA*z+A12*x0+w2*  where  AA*“A22*-A.  If  the 
nonlinearity  in  the  feedback  loop  satisfies  the 
Popov  integral  inequality,  then  the  necessary  and 
sufficient  condition  for  asymptotic  stability  is 

that  the  transfer  matrix  (sI-A)"*  must  be 

strictly  positive  real  [13]. 

The  errors  in  this  approximation,  which  are 
of  order  C,..),  era  due  to  the  substitution  of 
for  x  in  (7)  and  in  the  definition  of  the 
regions.  Substituting  x  «  x0  +  0(g)  in  (7)  and 
in  Hj  yields  the  system 

^  ■  A31*(x0+O(g))  +  Ajj*z  +  Wj1,  £(Q)“za  (9) 

Rt  «  {zs  di_1+0(u)  <  K^Xd+KjZ  i  di+0(g)) 

where  z  represents  the  actual  response.  In  the 
interior  of  any  particular  region,  both  the 
approximation  and  the  actual  model  are  linear. 
Previous  results  on  singular  perturbation  theory 
in  linear  systems  show  that  if  2(x‘ )"2(x' )+0(g) 
then  2(x")-£(x")+0(g)  for  r">x'  as  long  as  both  2 
and  £  stay  within  the  region.  The  problems  that 
may  arise  due  to  a  boundary  crossing  are  elimin¬ 
ated  if  the  class  of  system s  allowed  is  restrict¬ 
ed  to  those  in  which  the  vector  field  intersects 
a  boundary  hyperplane  at  a  large  enough  angle 
(i.e.  0(u°))-  In  these  systems  if  either  £  or  z 
crosses  into  another  region,  the  other  must  also 
cross  into  that  region.  The  resulting  error  in 
the  approximation  remains  of  order  0(g).  These 
conditions  are  summarized  in  the  following 
theorems.  Note  that  the  restriction  placed  on 
the  class  of  systems  is  sufficient  and  not 
necessary  for  proving  that  the  approximation 
error  is  of  order  0(g). 

Theorem  Z;  Let  the  vector  field  near  a 
boundary  at  dj«Kzz+Kxx0+O(g)  in  the  space  Rr  be 
given  by 

f(£)  “  A21*  (x0+0(u))  +  Ajj1  2  +  wji.  (10) 

Assume  that  f(£)  does  not  vanish  near  the 
boundary.  If  f(2)  intersects  the  boundary  with 
an  angle  of  order  0(p°),  then  the  difference 
between  the  solutions  of  (7)  and  (9)  is  0(g). 

Proof :  Assiune  z  crosses  the  d^  boundary  at 
t'  and  z  has  not  crossed  yet.  Prior  to  crossing 
£  •  z  +  0(g).  The  normal  vector  of  the  boundary 

hyperplane  is  given  by  n«Kz^/lKzl.  Since  f(z)*n 

“  0(g°),  then 

Kz(A21l  (*o+0(m))  +  Ajj1  2  +  Vj1)  •  0(g°) .  (11) 

It  follows  that 

Ks^Aji1  xo  +  Aji1  *  +  w2l)  “  0(g°)  (12) 

Define  s  and  i  by 

s  •  Kzz  -  di'  (13) 

a  -  Kzi  -  dt*  +  0(g)  (14) 


where  d^-di'K^,.  Assume  i,s>0.  For  2  to  cross 

the  boundary,  <0  where  ^  is  given  by  expres¬ 
sion  (11).  Correspondingly,  <  o  where  is 

ox  ax 

given  by  expression  (12).  At  the  boundary 
crossing,  s(x')»0  so  that  K2z-di '  »C(g) .  It 
follows  that  s(x')"0(g). 

Since  ^  "0(g°)  then  ^  ■  0(g°).  Hence,  Ax«0(g) 

since  As»s(x' )-0(g).  Therefore,  if  2  crosses  a 
boundary  into  a  new  region  at  x',  then  2  must 
also  cross  into  the  same  region  at  a  time  x"  such 
that  x""x'-K>'u). 

It  remains  to  be  shown  that  the  time  difference 
of  0(g)  in  the  boundary  crossing  has  0(g)  effect 
on  the  solution.  Let  A  -  A,,*  and  aA  »  A2jJ  -  A 
where^Rj  is  the  new  region  and  x0<x’  be  such  that 
both  £(x0)  and  2(x0)  lie  in  region  Ri.  Then  the 
solution  of  (7)  for  x>x'  is 

z(x)  -  *(x,x0)2(x0)  +|  *(x,o)( AAz+Aj j^Xo+Wj^ )do 

Jx' 

n'  i  i  (15) 

+  *(x,o)(A21  x0+w2  )  do 

where  4(x,x')  -  exp[A(x-x' ) ] .  Since  the  inte¬ 
grands  are  bounded  in  both  integrals  and  x"-x' 
■O(g),  equation  (15)  is  rewritten  as 

z(x)  ■  ♦(x,x0)z(x0)  +|  ♦(x,o)(AAz+A21^x0+w2-*)do 

Jx" 

rx"  j  ,  (16) 

+  4(x,o)(A21  x0+w2  )  do  +  0(g) 

Similarly,  the  solution  to  equation  (9)  is  found 
to  match  the  form  of  equation  (16)  exactly. 
Hence,  2(x)-2(x)+0(g) .  ■ 

Theorem  3:  Let  the  vector  field  near  a 

boundary  at  dj-KjpZ+KjjX,,  in  the  space  Rr  be  given 
by 

f(£)  -  A2  2 1  x  q  +  Aj,1  £  +  Wj1.  (17) 

Assume  that  f(£)  does  not  vanish  near  the. 
boundary.  If  f(£)  intersects  the  boundary  with 
an  angle  of  order  0(g°),  then  the  difference 
between  the  solutions  of  (7)  and  (9)  is  0(g). 

Proof:  The  proof  is  very  similar  to  that  of 

Theorem  2.  The  gist  of  the  proof  is  to  show  that 
if  z  crosses  the  boundary  prior  to  a  crossing  of 
2,  then  £  must  cross  within  a  time  of  order  0(g). 
The  time  delay  in  crossing  affects  the  error  in 
the  approximation  only  by  order  0(g). 

Using  the  results  of  Theorems  2  and  3  it  is 
seen  that  the  errors  in  the  approximation  are  of 
order  0(g).  The  restriction  given  in  Section  1 
that  the  regions  of  linearity  be  parallel  is  used 
in  the  proof  of  the  theorems  but  is  not  a  neces¬ 
sary  condition.  The  difficulty  is  showing  that 
if  a  solution  crosses  a  boundary  near  an  inter¬ 
section  of  boundaries  then  the  approximation  will 
remain  within  an  error  of  order  0(g). 
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3.  OUTER  SOLUTION 


A  reduced-order  model  for  system  (5)-(2)  is 
developed  below  with  approximation  errors  of 
order  0(d)  for  the  time  outside  of  the  boundary 
layer.  Assuming  that  the  fast  subsystem  given  in 
equation  (7)  is  asymptotically  stable  to  its 
equilibrium  point,  the  fast  component  of  z  is 
negligible  outside  of  the  boundary  layer. 
Therefore,  the  variables  of  the  reduced-order 
slow  model  are  x  and  the  quasi-steady-state  value 
zs  of  z.  Here,  zs  is  the  equilibrium  point  of 
(7)  when  x„  is  replaced  with  x.  Hence,  the 
quasi -stea',v-?tat''  value  of  z  is  a  continuous 
implicit  function  of  x.  (Continuity  is  shown 
below.)  The  value  of  zs  can  be  determined  by 
using  the  Katzenelson  algorithm  (see  Section  2.1) 
with  x  substituted  for  x4.  The  algorithm  is 
initialized  with  zsl  equal  to  the  previous  value 
of  zs.  Due  to  continuity,  a  small  change  in  x 
results  In  a  small  change  in  z£.  Hence,  in  time- 
integrating  the  system,  generally  only  steps  0)- 
3)  are  used  to  find  a  new  zs  at  each  time-step. 
Continuity  of  zs  as  a  function  of  x  is  shown  in 
the  proof  of  the  following  theorem. 

Theorem  4;  Let  f:Rr-*Rr  be  a  continuous 
piecewise-linear  mapping  defined  in  the  it*1 
region  by 

f(z)  “  A211  x  +  A22i  z  +  Wji  (18) 

If  f  is  a  homeooorphism  then  the  equilibrium 
point  zfi  of  (18)  is  given  by  a  continuous 
function  of  x. 


'(Aj3i)'1(A21i  x2  +  Wj1)  + 

<AJ11+1>'1<A2i1+1  xx  +  v3i+l)  -  o  (21) 

Adding  equation  (21)  to  equation  (20),  subtract¬ 
ing  the  .»sult  from  (19)  end  taking  the  norm  of 
both  sides  yield*: 

lzs>1-zs>2l  -  i<a221+1)‘1a211+1<*2-*.'i  £ 

l(AI2i+1)'1AJli+l»IXj-x1l 

Re-x-e  -  satisfies  *  LipshiLz  condition  in  the 
open  halfspace  in  region  Si+j.  Therefore,  zs  is 
continuous  for  x  such  that  (x,zs)  lies  on  a 
boundary  hyperplane.  Thus,  zs  is  a  continuous 
function  of  x.  • 

The  reduced-order  slow  model  of  system  (1)- 
(2)  for  t  outside  of  the  boundary  layer,  i.e. 
t>t6+d,  is  given  as  follows: 

xs  •  Aj^Xg  +  Aij^Zg  +  Vj1,  xs(t,)«x0  (22) 

where  zs  is  an  implicit  function  of  x  and  is 
found  using  the  Katzenelson  algorithm. 

The  error  in  the  approximation  is  uje 
entirely  to  the  fact  that  z-zs-M)(u).  This  error 
is  analogous  to  the  error  of  approximating  x  by 
x0  in  the  boundary  layer  solution.  Therefore, 
tlie  effect  of  the  error  can  be  analyzed  similarly 
as  in  Theorems  2  and  3  showing  that  the  errors  in 
the  solution  are  of  order  0(d) • 


Proof :  Since  f  is  a  homeomorphism,  a  unique 

solution  for  zs  exists  for  any  x.  Let  xv  be 
given  with  resulting  zs  given  by  zS(I. 

Let  Si  denote  the  region  of  (xx,zs  2)  in  . 
Suppose  (Xj,zSfl)  lies  in  the  interior  of  region 
Si-  Then  zSlj  can  be  written  as 

*s,i  "  -<**:*>  xt  +  w2*)  (19) 

It  is  clear  that  zs  is  a  continuous  function  of  x 
at  Xj  supposing  that  there  exists  a  6>0  such  that 
(x,zs)  lies  in  region  Si  for  all  x  such  that 

IXi-xICd.  Defining  M  -  K*  -  ^(Aj,1)'^^1  and 
dj'  *  d^  +  Kz(Ajj^)"^w2^  (for  j«i-l,i),  a  6  is 
given  by 


«  -  min  ^IM+(di’-Mxl)l  ,  IM+Otaj-d^1  )lj 

where  M+  *  M^(MM^)  Therefore,  zs  is  a  contin¬ 
uous  function  of  x  for  all  x  such  that  (x,zs) 
lies  in  the  interior  of  a  region. 

Suppose  xt  is  given  ao  that  (xj.z,^)  lies  on  a 
boundary,  say  d^-KgXj+KgZg  2.  Choose  x2  close  to 
xx  resulting  in  zg-zs>2.  ’  If  (x2,*,>2)  lies  in 
region  then  the  above  analysis  is  applied  and 
zs  is  considered  to  be  continuous  from  the  closed 
halfspace  in  region  Sj.  If  (x2,zS(2)  lies  In 
region  ,  then 

*.,j  *  -<A221+I)'1<A2l1+1  x2  +  w21+1)  (20) 
A  consequence  of  the  continuity  of  f  is  that 


4.  EXAMPLE 


The  techniques  previously  described  for 
separating  a  piecewise-linear  singularly  per¬ 
turbed  system  are  demonstrated  on  the  example 
below.  The  model  represents  a  linear  system  with 
a  saturation  nonlinearity  in  the  feedback  loop. 
Such  types  of  models  exist  in  both  flight 
controls  and  in  electrical  circuits.  The  system 
is  given  by 


M* 


■  A12x  +  A12z  -  B2u 

■  A2lx  +  A22z  -  B2u 

-i  ,  if  V14***  <  -l 

KgX+KgZ,  if  jKgX+tgzISl 
1  ,  if  KgX+KjZ  >  1 


(23) 

(24) 


where  u"0.1.  The  parameter  matrices  are  given  as 
follows: 


K*  -  [1  0.861}  Eg  -  [1.220  0.310] 

Th#  initial  conditions  are  given  es  x(0)  ■  t(0)  • 

[2.  3.]'. 


The  substitution  of  u  into  (23)-(24)  yields  a 
piecewise-linear  model,  with  three  regions: 
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S1-{(x,t):Kxx+Ksz<-l),  Sj»{(x,z): |K,x+Kzz|il}  and 
Sj-{(x,z):Kxx+JC8z>1)  .  The  initial  condition  is 
in  Sj> 

The  reduced-order  node  Is  given  in  the  fora  of 
equations  (7)  and  (22)  are  used  in  finding  the 
time  response.  Comparisons  between  these  results 
and  those  obtained  by  time-integrating  the  full 
order  model  are  shown  in  figure  1  through  Figure 
4.  Note  that  the  approximation  matches  the 
actual  response  very  closely,  i.e.  within  an 
error  of  order  0(u).  The  computation  time  for 
the  approximation  was  roughly  one-third  of  that 
for  the  actual  system.  Furthermore,  as  the  value 
of  u  decreases,  the  approximation  becomes  more 
accurate  and  the  relative  computation  time 
decreases  due  to  the  numerical  stiffness  in  the 
actual  system. 

5.  SUMMARY 

A  singular  perturbation  technique  is  devel¬ 
oped  in  this  paper  which  allows  for  a  decoupling 
of  a  continuous  piecewise- linear  system  into  slow 
and  fast  subsysteais.  Under  the  assumption  of 
asymptotic  stability,  the  fast  variable  is  found 
to  decay  in  the  boundary  layer  to  its  quasi- 
steady-state  solution.  This  quast-steady-state 
solution  is  given  by  a  continuous  implicit 
function  of  the  slow  variable.  The  solution  is 
found  using  the  finite  step  algorithm  given  *- 
the  paper.  Sufficient  conditions  for  the 
approximation  to  be  accurate  to  an  order  of  0(u) 
are  given.  The  technique  developed  is  success¬ 
fully  illustrated  via  a  numerical  example. 

REFERENCES 

[1]  P.V.  Kokotovic.  R.B.  O'Malley  and  P. 
Sannuti,  "Singular  Perturbations  and  Order 
Reduction  in  Control  Theory — An  Overview," 
Automatics,  vol.  12,  pp.  123-132,  March 
1976. 

[2]  J.J.  Levin,  "The  Asymptotic  Behavior  of  the 
Stable  Initial  Manifolds  of  a  System  of  Non¬ 
linear  Differential  Equations,"  Tran.  Am. 
Math.  Soc..  vol.  85,  pp.  357-368,  1957. 

[3]  J.  Katzenelson,  "An  Algorithm  for  Solving 
Nonlinear  Resistive  Networks,"  Bell  Syst. 
Tech.  J..  vol.  44,  pp.  1605-1620,  1965. 

[4]  T.  Fujisawa  and  E.S.  Kuh,  "Piecewise- linear 
Theory  of  Nonlinear  Networks,"  SIAM  J. 
Appl.  Math.,  vol.  22,  pp.  307-328,  March 
1972. 


[7]  T.  Ohtsuki,  T.  Fujisawa  and  S.  Kumagai, 
"Existence  Theorem  and  a  Solution  Algorithm 
for  Piecewise-Linear  Resistor  Networks," 
SIAM  J.  Mathematical  Analysis,  vol.  8,  pp. 
69-99,  Feb.  1977. 

(8)  S.M.  Rang  and  L.O.  Chua,  "A  Global  Represen¬ 
tation  of  Multidimensional  Piecewise-Linear 
Functions  with  Linear  Partitions,"  IEEE 
Trans.  Circuits  and  Systems,  vol.  CAS -25, 
pp.  938-940,  Nov.  1978. 

[Z]  W.C.  Rhzinboidt  and  J.S.  Vandwrg.-af t,  "On 
Piecewise  Affine  Mappings  in  Rn,"  SIAM  J . 
Appl .  Math. .  vol.  29,  pp.  680-689,  Dec. 
1975. 

[10)  V.C.  Prasad  and  P.B.L.  Gaur,  "Homeomorphism 
of  Piecewise-Linear  Resistive  Networks," 
Proc.  IEEE,  vol.  71,  pp.  175-177,  Jan.  1983. 

[11 )  M.  Kojima  and  R.  Saigal,  "On  the  Relation¬ 

ship  Between  Conditions  that  Insure  a  PL 
Mapping  is  a  Homt-— or^hism,"  Mathematics  of 
Operations  Research,  vol.  5,  pp.  101- 

109,  Feb.  1980. 

[12]  S-M  Lee  and  K-S  Chao,  "Multiple  Solutions  of 
Piecewise-Linear  Resistive  Networks,"  IEEE 
Trans.  Circuits  and  Systems,  vol.  CAS-30, 
pp.  84-89,  Feb.  1984. 

[13]  Y.  D.  Landau,  Adaptive  Control  -  The  Model 
Reference  Approach,  Marcel  Dekker,  1979. 


[5]  L.O.  Chua,  "Efficient  Computer  Algorithms 
for  Piecewise-Linear  Analysis  of  Resistive 
Networks,"  IEEE  Trans.  Circuits  and  Systems, 
vol.  18,  pp.  73-85,  Jan.  1971. 

[6]  S.N.  Stevens  and  P-M  Lin,  "Analysis  of 
Piecewise-Linear  Resistive  Networks  Using 
Complimentary  Pivot  Theory,"  IEEE  Trans. 
Circuits  and  Systems,  vol.  28,  pp.  429-441, 
May  1981. 


1726 


Figure  1:  Response  of  x2  to  initial  condition  for 
actual  system  (solid  line)  and  approximated 
system. 


Figure  2:  Response  of  x2  to  initial  condition  for 
actual  system  (solid  line)  and  approximated 
system. 
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Figure  3:  Response  of  z2  to  initial  condition  for 
actual  system  (solid  line)  and  approximated 
system. 
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Figure  4:  Response  of  z2  to  initial  condition  for 
actual  system  (solid  line)  and  approximated 
system. 
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Technical  Notes  and  Correspondence 


Singular  Perturbation  in  Piecewise-Linear  Systems 

B.  S.  HECK  AND  A.  H.  HADDAD 

a ostract — This  acts  analyzes  p'ecewUe-llnear  systems  which  art  singu¬ 
larly  perturbed.  A  technique  Is  developed  that  allows  decoupling  of  snch 
systems  into  fast  and  slow  subsystems  for  analysis  and  design.  The  results 
of  a  numerical  example  are  Included  to  demonstrate  this  technique. 

I.  Introduction 

Piecewise-lincar  systems  which  are  singularly  perturbed  are  found  in 
many  applications  including  electrical  circuits  and  flight  control:.  The 
piecewise  linearity  may  be  due  to  nonlinear  elements  such  as  saturation  or 
may  result  from  a  linearization  about  various  operating  points  of  a 
nonlinear  plant.  These  types  of  systems  are  numerically  very  stiff  and, 
hence,  are  difficult  to  analyze.  This  problem  may  be  alleviated  by  using 
singular  perturbation  theory  to  separate  the  system  into  reduced-order 
models,  one  containing  the  slow  dynamics  and  one  containing  the  fast 
dynamics.  The  use  of  standard  singular  perturbation  techniques,  however, 
requires  that  the  system  dynamical  equations  be  smooth  [1],  [2]  ruling  out 
their  use  on  piecewise-lincar  systems.  This  note  extends  the  general 
method  of  singular  perturbation  for  application  to  continuous  piecewise- 
lincar  systems. 

A.  Problem  Formulation 

The  syottm  considered  in  this  note  may  be  represented  in  the  following 
form: 


*-/>(x,  z)  x(t0)~xo 

(!) 

nt-Mx.z)  z(t0)-z o 

(2) 

where  /  and  /j  are  continuous  piecewise-linear  functions,  p  >  0  is  a 
small  parameter,  and  x  €  R”  and  z  G  R'.  The  functions  are  affine  in 
specific  regions  of  the  state  space  (Rp*r)  where  a  region  is  typically 
defined  as  an  intersection  of  halfspaces.  For  example,  (1)  and  (2)  are 
represented  in  the  fth  region  by  the  following  linear  system: 

*»A\,x+A'uz  +  w\  (3) 

l*mA,tlx+A,az+w,v  (4) 

For  the  purposes  of  this  note,  the  fth  region  is  defined  by  the  set  S(  *  {(*, 
z):d,. i  <  K,x  +  K,z  S  d,}  where  Kx  and  Kt  are  row  vectors  and  cf/_, 
<  d,  are  scalars.  By  this  definition,  the  type  of  regions  allowed  are 
parallel  in  that  the  boundaries  do  not  intersect.  An  example  of  a  physical 
system  which  has  this  description  is  one  in  which  the  piecewise-linear 
element  is  in  a  scalar  feedback  loop.  The  reason  for  the  restriction  will  be 
discussed  in  Section  n. 

The  following  is  an  outline  of  the  note.  Section  D  discusses  the 
boundary  layer  solution  and  develops  a  reduced-order  model  to  approxi¬ 
mate  this  solution.  The  outer  solution  along  with  a  corresponding 
reduced-order  model  is  discussed  in  Section  m.  A  numerical  example  is 
presented  in  Section  IV  to  demonstrate  the  techniques  developed  in  this 
note.  Section  V  concludes  the  note. 
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D.  Boundary  Layer  Solution 

The  fast  dynamics  of  the  system  are  most  prominent  during  the 
boundary  layer  and  can  be  decoupled  from  the  slow  dynamics  by 
introducing  an  expanded  time  scale  r  =  (f  -  fo)/M-  Examination  of  (1) 
shows  that  x stays  relatively  constant  with  respect  to  r  assuming  that  A'tl, 
A  '2,  and  w\  are  bounded  in  all  regions  S,  [1],  Equation  (2)  may  be 
rewritten  as  follows: 

^=/(f)  (5) 

where  Hr)  «  z(jit  +  f0)  and  /(£)  -  f2(x0,  £).  The  function  /  is  a 
continuous  piecewise-linear  mapping  from  Rr  into  Rr.  The  state  space  in 
R'  is  partitioned  into  regions  where  the  function  is  affine,  e.g.,  the  fth 
region  is  defined  as  the  set  R,  -  {z.d,. ,  <  +  Ktz  S  d,}.  A 

degenerate  case  where  Kt  =  0  results  in  the  existence  of  only  one  region 
in  R'so  that/is  affine.  The  initial  quasi-steady-state  value,  z,(f0),  of  z(f) 
is  a  stable  equilibrium  point  of  (5).  Note  that  the  equilibrium  point  of  the 
degenerate  case  is  easily  found. 

The  equilibrium  point(s)  of  (5)  for  the  nondegenerate  case  can  be  found 
using  solution  techniques  developed  for  piecewise-linear  resistive  net¬ 
works.  Many  papers  have  been  written  on  finding  the  solution  x  of  the 
equation/(jr)  -  y  where/is  a  continuous  piecewise-linear  function,  e  g., 
[3]-[9].  Fujisawa  and  Kuh  show  in  [4]  that  a  continuous  piecewise-linear 
ftmction  satisfies  a  Lipshitz  condition.  The  following  theorem  from  [4] 
gives  sufficient  conditions  for  the  existence  and  uniqueness  of  the 
solution. 

Theorem  I:  Let /be  a  continuous  piecewise-linear  mapping  ot  R'  into 
itself  and  let  J‘k  denote  the  matrix  composed  of  the  first  k  rows  and 
columns  of  the  Jacobian  matrix  J‘  in  region  R,.  The  mapping  is  a 
homeomorphism  of  R’  onto  itself  if,  for  each  k  =  1,  2,  •  •  • ,  r,  the 
determinants  of  the  k  x  k  matrices 

J I  J2  ...  /« 

do  not  vanish  and  have  the  same  sign. 

This  previous  work  is  used  in  finding  the  equilibrium  point(s)  of  system 
(5)  by  solving  /(£)  =  0.  In  this  application,  J1  ~  A 'u  and  each  A  ^  is 
assumed  to  be  Hurwitz  for  stability  purposes.  The  conditions  of  Theorem 
1  may  be  stringent  and  various  other  sufficient  conditions  for  the  existence 
and  uniqueness  of  the  solution  are  given  in  [9]-[l  1].  Also,  [12]  discusses 
nonunique  solutions. 

A.  Algorithm  to  Solve  for  Equilibrium  Point 

The  Katzenelson  algorithm  is  widely  used  in  solving  for  x  in  the 
equation 

fW~y  (6) 

where/:/!'  —  Rr  is  continuous  and  piecewise  linear.  The  basic  outline  of 
this  algorithm  used  in  solving  /(£)  =  0  is  given  below.  More  details  of 
the  general  method  are  given  in  [4).  Let  W1  =  AltJ x0  +  iW  vy,  and 
denote  the  iteration  number  on  z,  and  X  by  superscripts. 

0)  Initialize  by  letting  i  =  1  and  z'  =  z«. 

1)  Solve  z  =  -(AJn)~'tVf  where  region  R,  contains  z1,. 

2)  If  z  lies  in  region  Rh  then  z,  «  z  and  stop. 

3)  Otherwise,  let  Rt  be  the  region  containing  z; 

If  k  >  y,  then  d  -  dj  and  then  let ;«;+  1 

If  k  <  y,  then  d  *  dj.  \  and  then  let  j  =  j  -  1 . 

4)  Solve  X'  «  (K,z,  +  Ap r0  -rf)/A,(z'  -  z). 

5)  Solve  z'+ 1  ■  z'  -  X'(z'  -  z). 

6)  Let  i  »  /  4-  I  and  go  to  1). 
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It  is  shown  in  [4]  that  if  the  piecewise-linear  function  is  a  homeomor- 
phism  (e.g,,  it  satisfies  the  conditions  of  Theorem  1).  then  the  algorithm 
will  converge  in  a  finite  number  of  steps. 

B.  Boundary  Layer  Approximation 

A  fast  model  approximating  the  dynamics  occurring  in  the  boundary 
layer  can  be  found  once  the  equilibrium  point  of  system  (S)  is  known.  The 
boundary  layer  solution  is  then  given  as  fc(r)  =  £(r)  -  z,(/0).  In  this 
application,  z,  must  be  found  implicitly  because  fi  is  not  smooth. 
Therefore,  the  fast  model  approximating  the  boundary  layer  solution  is 
given  in  terms  of  i.  In  the  Ah  region  the  fast  model  is  given  by 

di 

—  =  A'2|jr0  +  -4,Ji£+H''2  £(0)  =  Zo 

f/(r)  =  z(r)-ZJ(t„)  (7) 

where  the  Ah  region  is  defined  by  the  set  R,  =  {t-d,.  t  <  Kxx0  +  Kti  5 

d,). 

For  the  purposes  of  this  note,  it  is  assumed  that  there  exists  exactly  one 
equilibrium  point  which  is  asymptotically  stable.  Multiple  stable  equilib¬ 
rium  points  may  be  handled  by  partitioning  the  state  space  into  domains  of 
attraction  for  the  various  equilibrium  points  and  the  analysis  in  this  note 
holds  for  each  domain  of  attraction 

Asymptotic  stability  is  assumed  in  this  system  although  there  is  no 
known  general  method  for  determining  asymptotic  stability  of  piecewise- 
linear  systems.  Depending  on  the  specific  system  under  consideration,  a 
Lyapunov  function  may  be  found.  Another  possibility  is  to  use  standard 
SISO  frequency  domain  techniques  or  hyperstability.  For  using  hypersta¬ 
bility  notions,  system  (5)  may  be  rewritten  as 

di 

jr~At  +  Bu  (8) 

where  A  is  chosen  to  be  stable,  B  is  the  identity  /,  and  u  is  defined  in  the 
Ah  region  to  be  u  =  AA't  +  A\2x0  +  w'2  where  AA '  =  A‘u  -  A.  If  the 
nonlinearity  in  the  feedback  loop  satisfies  the  Popov  integral  inequality, 
then  the  necessary  and  sufficient  condition  for  asymptotic  stability  is  that 
the  transfer  matrix  is  l  -  A)' 1  must  be  strictly  positive  real  (13). 

The  errors  in  this  approximation,  which  are  of  order  Oiji),  are  due  to 
the  substitution  of  jr0  for  x  in  (7)  and  in  the  definition  of  the  regions. 
Substituting  x  =  x0  +  0(jj)  in  (7)  and  in  R,  yields  the  system 

dz 

—  =  A‘}l{x0  +  O{)i))  +  A'uz+w'J  £(0)  =  Zo 

/?,  =  {£:  di.,  +  Oin)<Kxx9  +  K,i  a  </,  +  0(ji)}  (9) 

where  £  represents  die  actual  response.  In  the  interior  of  any  particular 
region,  both  the  approximation  and  the  actual  model  are  linear.  Previous 
results  on  singular  perturbation  theory  in  linear  systems  show  that  if  £(r') 
”  £(r')  +  O(m),  then  £(r')  =  £(r')  +  0(ji)  for  r'  >  r'  as  long  as 
both  t  and  i  stay  within  the  region.  The  problems  that  may  arise  due  to  a 
boundary  crossing  are  eliminated  if  the  class  of  systems  allowed  is 
restricted  to  those  in  which  the  vector  field  intersects  a  boundary 
hyperplane  at  a  large  enough  angle  [i.e. ,  *O0t)l.'  In  these  systems  if 
either  £  or  £  crosses  into  another  region,  the  other  must  also  cross  into  that 
region.  The  resulting  error  in  the  approximation  remains  of  order  Oiji). 
These  conditions  are  summarized  in  the  following  theorems.  Note  that  the 
restriction  placed  on  the  class  of  systems  is  sufficient  and  not  necessary 
for  proving  that  the  approximation  error  is  of  order  Oiji). 

Theorem  2:  Let  the  vector  field  near  a  boundary  at  d,  =  Af«£  +  Kxx0 
+  Oiji)  in  the  space  Rr  be  given  by 

/(£)  =  A  '2l(x0  -1-0(11))  +  A'u£+h-'2.  (10) 

Assume  that  /(£)  does  not  vanish  near  the  boundary.  If  /(£)  does  not 
intersect  the  boundary  with  an  angle  of  order  Oiji),  then  the  difference 
between  the  solutions  of  (7)  and  (9)  is  of  order  0(ji). 

1  A  *  0(m)  is  used  to  mean  JA||/|i—  +  ®a»a~*0- 


Proof:  Assume  £  crosses  the  d,  boundary  at  r  ’  and  £  has  not  crossed 
yet.  Prior  to  crossing  £  =  £  +  Oiji).  The  norma)  vector  of  the  boundary 
hyperplane  is  given  by  n  =  K*/\\Kt\\.  Since  f(z)-n  *  Oiji),  then 


KfA  '21  (jr0  +  O(m))  +  A  ‘uz  +*))*  O(m) 

di) 

It  follows  that 

KfA'^Xt-i  A‘ui+  w‘2)*Oifi)- 

(12) 

Define  S  and  £  by 

II 

1 

a 

(13) 

s=K.i-d;  +Oif) 

(14) 

where  d't  =  d,  -  K,xa.  Assume  i,  i  >  0.  For  £  to  cross  the  boundary  , 
dS/dr  <  0  where  ds/dr  is  given  by  expression  (i  1).  Correspondingly, 
di/dr  <  0  where  dS/dr  is  given  by  expression  (12).  At  the  boundary 
crossing,  £(r')  =  0  so  that  Ktz  -  d /  *  Oiji).  It  follows  that  f(r')  = 
Oiji).  Since  dS/dr  ±  Oiji),  then  AS/ At  *  Oiji).  Hence,  Ar  =  Oiji) 
since  AS  =  £(r')  *  Oiji).  Therefore,  if  £  crosses  a  boundary  into  a  new 
region  at  t',  then  £  must  also  cross  into  the  same  region  at  a  time  r'  such 
that  r'  *  t'  +  Oiji). 

It  remains  to  be  shown  that  the  time  difference  of  Oiji)  in  the  boundary 
crossing  has  Oiji)  effect  on  the  solution.  Let  ,4  =  A‘uondAA  =  A - 
A  where  Rj  is  the  new  region  and  let  r0  <  r'  be  such  that  both  £(r0)  and 
£(to)  lie  in  region  R,.  Then  the  solution  of  (7)  for  r  >  r'  is 

£(r)  =  *(r,  r0)£(r0)+  [  *(r,  a)iAAi  +  A{tx0  +  w')  da 

+  f’  *(r.  o)(A'21Xb+w'2)da  (15) 

J’o 

where  <fr(r,  r')  *  exp  [A(t  -  r')].  Since  the  integrands  are  bounded  in 
both  integrals  and  r*  -  r'  *  Oiji),  (15)  is  rewritten  as 

£(r)  =  4>(r,  r0)£(r0>+  j  4>(t,  a)(AAt  +  A,llx0+ w^})  da 

+  T  *(r.  oKA^Xo+w'Jda  +  Oiu).  (16) 
J'o 

Similarly,  the  solution  to  (9)  is  found  to  match  the  form  of  equation  (16) 
exactly.  Hence,  £(r)  *  £(r)  +  Oiji).  ■ 

Theorem  3:  Let  the  vector  field  near  a  boundary  at  d,  =  K.i  +  Kxx a 
in  the  space  R'  be  given  by 

/(£)»*  v42|jr0  +  /4y£+  w'j.  (17) 

Assume  that  /(£)  does  not  vanish  near  the  boundary.  If  /(£)  does  not 
intersect  the  boundary  with  an  angle  of  order  Oiji),  then  the  difference 
between  the  solutions  of  (7)  and  (9)  is  of  order  Oiji). 

Proof:  The  proof  is  very  similar  to  that  of  Theorem  2.  The  gist  of  the 
proof  is  to  show  that  if  £  crosses  the  boundary  prior  to  a  crossing  of  £,  then 
£  must  cross  within  a  time  of  order  OijT).  The  time  delay  in  crossing 
affects  the  error  in  the  approximation  only  by  order  Oiji). 

Using  the  results  of  Theorems  2  and  3  it  is  seen  that  the  errors  in  the 
approximation  are  of  order  Oiji).  The  restriction  given  in  Section  I  that 
the  regions  of  linearity  be  parallel  is  used  in  the  proof  of  the  theorems  but 
is  not  a  necessary  condition.  The  difficulty  is  showing  that  if  a  solution 
crosses  a  boundary  near  an  intersection  of  boundaries,  then  the 
approximation  will  remain  within  an  error  of  Oiji). 

III.  Outer  Solution 

A  reduced-order  model  for  system  (1)  and  (2)  is  developed  below  with 
approximation  errors  of  order  Oiji)  for  the  time  outside  of  the  boundary 
layer.  Assuming  that  the  fast  subsystem  given  in  (7)  is  asymptotically 
stable  to  its  equilibrium  point,  the  fast  component  of  z  is  negligible  outside 
of  the  boundary  layer.  Therefore,  the  variables  of  the  reduced -order  slow 
model  are  x  and  the  quasi-steady-state  value  z,  of  z.  Here,  z,  is  the 
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equilibrium  point  of  (7)  when  x0  is  replaced  with  x.  Hence,  the  quasi- 
steady-state  value  of  z  is  a  continuous  implicit  function  of  x.  (Continuity  is 
shown  below.)  The  value  of  z,  can  be  determined  by  using  the 
Katzenelson  algorithm  (see  Section  D-A)  with  x  substituted  for  xo.  The 
algorithm  is  initialized  with  z]  equal  to  the  previous  value  of  z,.  Due  to 
continuity,  a  small  change  in  x  results  in  a  small  change  in  z,.  Hence,  in 
time-integrating  the  system,  generally  only  steps  0)-2)  are  used  to  find  a 
new  z,  at  each  time-step.  Continuity  of  Z,  as  a  function  of  x  is  shown  in  the 
proof  of  the  following  theorem. 

Theorem  4:  Let  f.R'  -»  R'  be  a  continuous  piecewise-linear  mapping 
defined  in  the  rth  region  by 

/(zl-A'.e+A'jZ+w',.  08) 

If /is  a  homeomorphism,  then  the  equilibrium  point  z,  of  (18)  is  given  by 
a  continuous  function  of  x. 

Proof:  Since /is  a  homeomorphism,  a  unique  solution  for  z,  exists 
for  any  x.  Let  Xt  be  given  with  resulting  z,  given  by  z,j.  Let  S<  denote  the 
region  of  (Jf|.  Z,,i)  in  RP*'. 

Suppose  (xi,  Z,.i)  lies  in  the  interior  of  region  Sh  Then  Z,.i  can  be 
written  as 

+  (19) 

It  is  clear  that  z,  is  a  continuous  function  of  x  at  x,  supposing  that  there 
exists  a  5  >  0  such  that  (x,  Z,)  lies  in  region  S,  for  all  x  such  that  ||x,  - 
x||  <  i.  Defining  M  *  K,  -  KJA'^-'A^  and  rf/  =  <*,+ 
(for  j  =  i  -  1,  /),  a  5  is  given  by 

&  *  min  ( |  KT  (d /  -  Afx,  )|| ,  AT  (Mx,  -  <//_ ,  )||  ] 

where  M1  =  Therefore.  Z.  is  a  continuous  function  of  X 

for  all  x  such  that  (x,  Z,)  lies  in  the  interior  of  a  region. 

Suppose x,  is  given  so  that  (X| ,  Z,,t)  lies  on  a  boundary ,  say  d,  *  KrX\ 

+  K,Z, .,.  Choose  x2  close  to  x,  resulting  in  z,  »  z,j-  If  (x2,  Z,.j)  lies  in 
region  S,,  then  the  above  analysis  is  applied  and  z,  is  considered  to  be 
continuous  from  the  closed  halfspace  in  region  S(.  If  (x2,  Z,.2)  lies  in 
region  S,  ,  i,  then 

z,.2  =  -(AaV'Mii'xj+w'/').  (20) 

A  consequence  of  the  continuity  of/ is  that 

-  (A  >aY'(A '  ,x,  +  w'2)  +  (A  x')~'(A  'J  X,  -t-  w'* 1 )  =  0.  (21) 

Adding  (21)  to  (20),  subtracting  the  result  from  (193  and  talcing  the  norm 
of  both  sides  yields 

II Z,. ,  —  Z,.2 1|  =  II (-4  1 ) - 1  '27 ’ (Jr2  —  x, )H  S  H(/l'z!')~,'‘,2!'ll  li*!-*'!!- 

Hence,  z,  satisfies  a  Lipshitz  condition  in  the  open  halfspace  in  region 
S/*  |.  Therefore,  Z,  is  continuous  for  x  such  that  (x,  z,)  lies  on  a  boundary 
hyperplane.  Thus,  z,  is  a  continuous  function  of  x.  ■ 

The  reduced-order  slow  model  of  system  (1)  and  (2)  is  given  in  the  rth 
region  of  R ",  jx, <  K,x,  +  Ktz,  S  d,},  as  follows: 

X,-/4'1x1-t-A'„z,+  w\  X,(r0)-x«  (22) 

where  z,  is  an  implicit  function  of  x,  and  is  found  using  the  Katzenelson 
algorithm.  The  actual  variables,  xand  z,  are  approximated  by  x,  and  z,  for 
!  outside  of  the  boundary  layer,  i.e. ,  t  >  to  +  S- 
The  error  in  the  approximation  is  due  entirely  to  the  fact  that  z  =  z, 
0(jn).  This  error  i-  analogous  to  the  error  of  approximating  x  by  Xo  in  the 
boundary  layer  solution.  Therefore,  the  effect  of  the  error  can  be  analyzed 
similarly  as  in  Theorems  2  and  3  showing  that  the  errors  in  the  solution 
are  of  order  C/(jt). 

rv.  Example 

The  techniques  previously  described  for  separating  a  piecewise-linear 
singularly  perturbed  system  are  demonstrated  in  the  following  example. 


Fig.  I.  Response  of  x  to  initial  condition  for  actual  system  (solid  line)  aid 
approximated  system. 

The  model  represents  a  linear  system  with  a  saturation  nonlinearity  in  the 
feedback  loop.  Such  types  of  models  exist  in  both  flight  controls  and  in 
electrical  circuits.  The  system  is  given  by 

x  =  A,,x  +  A,  2z-B,u  (23) 

/iZ  *  A2,X  +  AnZ  -  B2u  (24) 

(-1,  if  K,x  +  Kzz<  -  1 

K,x  +  K,Z,  i(\K,x  +  K,z\  g  1 
1,  if  K,x  +  K:z>  I 

where  n  =  0.1.  The  parameter  matrices  are  given  as  follows: 

o  o]  A“‘  [o.°«  0 

a.-[;  -f  ]  f-[-r 

A,- (1  0.861]  *>(1.220  0.310). 

The  initial  conditions  are  given  is  x(0)  =  z(0)  =  [2.  3.]r. 

This  substitution  of  u  into  (23)  and  (24)  yields  a  piecewise-linear 
model,  with  three  regions:  5,  =  {(x,  z)-Kxx  +  K,z  <  -  1},  52  =  {(x, 
z);|AT,x  +  AT,zl  s  1},  and  S>  -  {(x,  z).fCsx  +  K,z  >  1 } .  The  initial 
condition  is  in  S3. 

The  reduced-order  models  given  in  the  form  of  (7)  and  (22)  are  used  in 
finding  the  time  response.  Comparisons  between  these  results  and  those 
obtained  by  time-integrating  the  foil  order  model  are  shown  in  Figs.  1-4. 
Note  that  the  approximation  matches  the  actual  response  very  closely, 
i.e.,  within  an  error  of  order  0(/i).  The  computation  time  for  the 
approximation  was  roughly  one-third  of  that  for  the  actual  system. 
Furthermore,  as  the  value  of  m  decreases,  the  approximation  becomes 
more  accurate  and  the  relative  computation  time  decreases  due  to  the 
numerical  stiffness  in  the  actual  system. 

V.  Summary 

A  singular  perturbation  technique  is  developed  in  this  note  which 
allows  for  a  decoupling  of  a  continuous  piecewise-linear  system  into  slow 
and  fast  subsystems.  Under  the  assumption  of  asymptotic  stability,  the  fast 
variable  is  found  to  decay  in  the  boundary  layer  to  its  quasi-steady-state 
solution.  This  quasi-steady -sute  solution  is  given  by  a  continuous  implicit 
function  of  the  slow  variable.  The  solution  is  found  using  the  finite  step 
algorithm  given  in  the  note.  Sufficient  conditions  for  the  approximation  to 
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Fig  2.  Response  of  x,  to  initial  condition  for  actual  system  (solid  line)  and 
approximated  system. 


Fig.  3.  Response  of  z,  to  initial  condition  for  actual  system  (solid  line)  and 
approximated  system. 


be  accurate  to  an  order  of  0(ji )  are  given.  The  technique  developed  is 
successfully  illustrated  via  a  numerical  example. 
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ABSTRACT 


2.  BACKGROUND  MATERIAL 


This  paper  continues  the  analysis  of  singularly 
perturbed  piecewise- linear  systems.  It  provides  a 
less  restrictive  sufficient  condition  for  the 
validity  of  the  singular  perturbation  analysis  of 
such  systems.  The  paper  also  considers  the 
additional  time-scale  separation  analysis  required 
by  the  existence  of  sliding  modes.  Finally,  the 
effect  of  random  inputs  on  such  systems  is  exam¬ 
ined. 


1.  INTRODUCTION 

This  paper  addresses  problems  in  piecewise- 
linear  systems  which  are  singularly  perturbed. 
Such  systems  are  found  in  many  applications 
Including  electrical  circuits  and  flight  controls. 
The  piecewise-linearity  may  be  due  to  nonlinear 
elements  such  as  saturation  or  may  result  from  a 
linearization  about  various  operating  points  of  a 
nonlinear  plant.  Singular  perturbation  theory  is 
used  to  separate  the  system  into  reduced-order 
models,  one  containing  the  slow  dynamics  and  one 
containing  the  fast  dynamics.  Standard  singular 
perturbation  techniques,  however,  are  limited  to 
systems  which  are  smooth  [1,2].  Recently,  singular 
perturbation  theory  has  been  extended  to  certain 
types  of  piecawise-linear  systems,  i.e.,  those  with 
continuous  dynamics  [3]  and  those  with  a  scalar 
quantized  control  [A],  This  paper  extends  these 
earlier  results.  These  earlier  papers  [3, A] 
provided  reduced-order  models  for  the  slow  and  fast 
dynamics  and  theorems  showing  that  these  models 
approximate  the  actual  system  within  an  appropri¬ 
ately  small  error.  The  theorems  were  based  on 
geometric  ideas  and  were  restrictive  in  their 
applicability.  The  results  are  extended  to  include 
the  occurrence  of  a  sliding  mode  in  the  quantized 
control  case.  A  new  nongeometric  criterion  is 
Introduced  for  using  singular  perturbation  in  the 
continuous  piecewise-linear  case.  This  criterion 
is  easy  to  use  and  the  proof  is  straightforward. 
Finally,  the  effect  of  random  inputs  is  also 
considered. 

The  remainder  of  the  paper  is  outlined  as 
follows.  Section  2  contains  background  information 
summarizing  the  results  of  [3]  and  [A]  and  provld- 
ing  physical  insight  into  the  restrictions  required 
for  using  these  results,  theorems.  Section  3 
discusses  the  effect  of  a  sliding  mode  occurring  in 
the  quantized  control  case.  Section  A  contains  a 
new  criterion  for  applying  singular  perturbation 
theory.  The  random  input  analysis  is  contained  in 
Section  5. 


The  two  types  of  system  analyzed  in  [3, A]  are 
those  which  are  continuous  and  those  which  are  the 
result  of  a  quantized  control.  Both  types  of 
systems  may  be  expressed  in  the  following  form. 

x  ■  fx(x,z),  x(t0)~x0  (1) 

uz  -  f,(x,z),  z(t0)-z0  (2) 

where  u  is  a  small  positive  parameter  and  fx  and  f2 
are  piecewise-linear  functions  mapping  from  Rn‘fm  to 
Rn  and  R®,  respectively.  The  functions  are  affine 
in  specific  regions  of  the  state  space  (Rn+m)  where 
a  region  is  typically  defined  as  an  intersection  of 
halfspaces.  The  i1™  ’■~£ion  is  defined  by  the  set 

Sj-Hx.z):  d^.j<K1x+KjZSdi)  where  Kx  and  K:  are  row 
vectors. 

The  systems  with  continuous  dynamics  as 
analyzed  in  [3]  are  those  where  the  piecewise- 
linear  functions,  fx  and  f2,  are  continuous.  Such 
systems  may  be  represented  in  the  1th  region  by  the 
following  "linear"  system: 

x  -  An'x  +  A1Jiz  +  wxi  (3) 

uz  *  A21ix  +  A22iz  +  w]i  (A) 

The  fast  model  yielding  the  boundary  layer  solution 
is  found  by  introducing  an  expanded  time-scale 

and  (2)  are  expressed  in 


x(0)»  x„  (5) 

z(0)«  z0  (6) 

x(x)»x(ux+t0 ) .  The 

variable  x  is  found  to  remain  constant  with  respect 
to  x,  so  x(x)*x0.  Equation  (6)  is  then  approxi¬ 
mated  as  follows: 

*  f(z),  z(0)«z„  (7) 

where  f(z)«f2(x, ,z).  The  function  f  is  a  contin¬ 
uous  piecewise-linear  mapping  from  R°  into  Rm.  The 
state  space  in  RB  is  partitioned  into  regions  where 
the  function  is  affine;  e.g.,  the  1th  region  is 
defined  by  the  set  R<*{z:  di.^<K1x8+K2zSdi} .  The 
equilibrium  point  for  (7)  (i.e.,  the  initial  quasi- 
steady-state  solution,  z-(t,))  is  found  using  the 
Katzenelson  algorithm  [3J.  The  approximation  for 
the  boundary  layer  solution,  given  by  z-zs(t,),  is 
found  implicitly  from  the  fast  model  defined  in  the 
1th  region  of  RM  as  follows: 


X"(t-t„)/u.  Equations  (1) 
the  x-time  as 

dx  “  M*.*). 
where  z(x)*z(ux+t0 )  and 
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tfnuflr***': 


dt  "  A,!1*,  ♦  A,, it  +  Wji  (8) 

The  reduced-order  flow  model  of  (3)-(4)  for  t 
outslda  of  the  boundary  layer  is  given  in  the  ic^ 
region  of  Rn,  (xs:  di.jO^Xj+KgZjSdi) ,  as  follows: 

*s  *  Anlxs  +  AX2i*#  ♦  wji,  x#(t„)-x0  (9) 

where  zs  is  a  continuous  implicit  function  of  x  and 
is  found  using  the  iCatzenelson  algorithm. 

Reduced-order  models  for  systems  with  scalar 
quantized  control  input  are  developed  in  [4].  It 
was  found  that,  without  loss  of  generality,  only 
those  systems  need  to  be  considered  which  satisfy 

(I)  and  (2)  with 

f1(x,Z>  -  AgX  +  BgU  (10) 

fj(x,z)  -  A2z  +  BjU  (11) 

u  -  Qf-KiX-Kjt) 

The  quantizer  function  is  defined  as  Q( -Kxx-K2z)"C^ 
for  (x,z)  in  the  ith  region.  It  is  required  that 
c^<ci+i,  d0"-«,  dn+^«+«  and  that  A2  be  invertible. 

A  fast  model  for  this  system  approximating  the 
actual  solution  in  the  boundary  layer  was  developed 
in  a  manner  similar  that  for  thi  cc.'tlnuous 

dynamics  case  described  above.  Using  notation 
introduced  previously,  the  fast  model  is  given 
below. 

—  ■  Ajt  +  B2u;  S(0)»Zg  (12) 

u  ”  Q( -KjXg-Kjz) 

The  boundary  layer  solution  is  given  as  z(t)-zs, 
where  z?  is  the  equilibrium  point  of  (12).  Define 
z<“-A2**B2c^.  Then  zs  can  be  written  as  a  mapping 
of  KiX,,  zs“f(KlXg),  where  f(C)  is  as  follovs. 

i)  f(5)  *  -Aj_1B2u,  if  K2“0  where  u«C(-5), 

ii)  f(C)  -  *t,  if  K2»0 

and  diS-KJzi-C<di+^  for  some  i,  (13) 

iii)  f(£)  -  (t+di+1)m,  if  K2»0  and 

■di+l *,:i*i+l  (  5  *  *di+i**2*i  for  some  i 
where  m  -  (zi+1-zi)/(K2(zi-zi+1)] . 

If  there  is  no  feedback  from  the  fast  variable, 
then  case  1)  holds.  Case  ii)  corresponds  to  an 
equilibrium  point  z^  lying  inside  its  own  region, 
i.e.  (x,,zi)cS1.  Case  iii)  corresponds  to  an 
equilibrium  point  lying  on  one  of  the  boundaries 
between  regions.  Tor  case  iii),  the  resulting 
control  switches  rapidly  between  two  values  to 
maintain  the  equilibrium.  It  was  shown  in  [4]  that 
f  Is  single-valued  if  K2A2*lB2<0;  therefore,  this 
assumption  will  be  made  in  this  paper.  Note  that  f 
Is  a  continuous  function. 

The  quantized  system  given  by  (l)-(2)  and  (10)- 

(II)  is  approximated  outside  of  the  boundary  Layer 
by  the  solution  to  the  following  alow  model: 

*s  *  A**,  +  B,ui  x,(t,)-x,  (14) 

u  -  Q( **i*a-l2z,)s  t,  •  f(K1xs) 


By  the  definition  of  f,  it  is  seen  that  case  iii) 
applies  for  (zs,zs)  lying  on  a  boundary  hyperplane 
and  case  ii)  applies  otherwise. 

The  approximation  errors  for  the  reduced-order 
models  (8),  (9),  (12)  and  (14)  are  shown  under 
certain  restrictions  to  be  of  order  0(u).  In  the 
fast  model  of  (8),  the  error  is  due  to  the  substi¬ 
tution  of  x0  for  x.  The  actual  solution,  £,  is 
given  by  the  system  described  in  region  Ri  as 

■  Alli(x,+0(u))  +  A22i£  +  wj1  (15) 

Rj  “  (z:  di.j+0(y)  <  KjXg+KgZ  i  di+0(u)). 

The  following  theorems  from  [3]  prove  that  the 
approximation  errori  are  of  order  0(m). 

Theorem  1 :  Let  the  vector  field  near  a 
boundary  at  d^KjZ+KjXg+CHu)  in  the  space  R®  be 
given  by 

f(z)  ■  A21i(xg+C'v>)  +  A22*z  +  w 2 1 

Assume  the  f(z)  does  not  vanish  near  the  boundary. 
If  f(z)  intersects  the  boundary  with  an  angle  of 
order  0(u#),  then  the  difference  between  the 
solutions  of  (8)  and  (15)  is  of  order  0(u). 

Theorem  2:  Let  the  vector  field  near  a 
boundary  at  di*K2z+K1xe  in  the  space  Rn  be  given  by 

f(z)  -  Ajj^xg  +  Aj^z  +  w,1 

Assume  that  f(z)  does  not  vanish  near  the  boundary. 
If  f(z)  intersects  the  boundary  with  an  angle  of 
order  0(u°),  then  the  difference  between  the 
solutions  of  (8)  and  (15)  is  of  order  0(u). 

The  gist  of  the  proofs  of  these  theorems  is 
that  if  the  solutions  of  (8)  and  (15)  both  exist  a 
particular  region  of  the  state  space,  then  the 
error  between  them  is  of  order  0(m)  due  to  the 
linearity.  The  problems  that  may  arise  at  a 
boundary  crossing  are  eliminated  due  to  the 
restriction  on  the  vecto-  field.  Thus,  if  one 
solution  crosses  into  another  region,  the  other 
solution  must  also  cross  into  that  region  within  a 
time  delay  of  0(m)-  The  resulting  error  remains  of 
order  0(u). 

Theorems  1  and  2  may  be  directly  applied  to  the 
quantized  control  system  to  show  that  the  error  in 
the  fast  model  (12)  is  also  of  order  0(u)  since 
continuity  is  not  required  in  the  proofs.  The 
approximation  errors  in  both  slow  models  (9)  and 
(14)  are  due  to  the  fact  that  z*zs+0(u).  This 
error  is  analogous  to  the  error  introduced  into  the 
boundary  layer  solutions;  therefore.  Theorems  1  and 
2  are  applicable.  The  main  consideration  for  using 
the  slow  models  is  that  the  boundary  layer  solution 
must  be  stable  so  it  is  negligible  outside  of  the 
boundary  layer.  A  further  consideration  is  that  z9 
must  vary  slowly  with  x,  so  that  the  fast  dynamics 
are  not  excited.  This  was  shown  for  both  models 
separately  in  (3]  and  (4). 

The  restriction  in  the  hypothesis  of  Theorems  1 
and  2  concerning  the  angle  of  intersection  is  hard 
to  satisfy  in  many  cases.  Tor  example,  the  angle 
of  Intersection  described  in  Theorem  2  is  found 
from  the  inner  product  of  f(z)  and  the  normal  to 
the  surface,  nml2^/IK2l.  Hence,  it  is  required 
that 
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♦  Ajj1*  +  "i1)  ■  O(m')  (16) 

near  tha  boundaries.  Not*  that  tha*  boundary 
hyperplane*  ara  parallel ;  all  ara  given  as  trans- 
latas  of  tha  null  space  of  K2  in  Ra. 

We  now  define  a  new  variable  y  by 

y  *  *  +  (Aj,1)  l(A2l*x0  +  Wj1). 

Then  tha  condition  in  (16)  becomes 

KjA22*y  •  0(u°). 

along  the  boundaries  defined  by 

K,y  -  dt's  dt'  -  di-H:,(A,,1)‘l(A111x0-Hf,1)  +  E2xs 

If  t2*0,  and  since  A22*  has  full  rank,  the  condi¬ 
tion  will  fail  only  in  the  0(u)  neighborhood  about 
the  Intersection  of  the  null  space  of  K2A22*-  and 
tha  boundary.  Note  that  this  intersection  is  an 
m-2  dimensional  manifold  on  which  the  vector  field 
is  exactly  tangent  to  the  boundary.  (If  ,A221  does 
not  rotate  the  domain  space,  e.g.  if  A22*«-I,  then 
there  is  no  intersection.) 

Hence,  the  use  of  Theorems  1  and  2  in  showing 
that  a  particular  system  approximation  is  valid, 
almost  requires  knowing  tha  solution  beforehand. 
Unless  A22  has  special  properties  mentioned  above, 
there  exists  at  least  one  point  of  tangency  on 
every  boundary.  If  the  vector  field  is  continuous, 
then  there  exists  only  one  point  of  tangency.  If 
the  vector  field  is  discontinuous  at  the  boundary, 
then  there  are  two  distinct  points  of  tangency,  one 
for  each  side  of  the  boundary.  The  points  in  the 
space  where  the  condition  of  the  theorems  fails 
form  a  set  of  measure  0(u)  in  the  space.  Whether 
the  solution  of  the  system  is  in  this  set  depends 
on  the  Initial  conditions.  However,  note  that  this 
condition  is  sufficient  but  not  necessary  for  the 
approximation  error  of  the  reduced-order  models  to 
be  of  order  0(p). 

3.  SLIDING  MODE  EQUATIONS 

The  previous  results  on  singular  perturbation 
of  systems  with  quantized  control  do  not  account 
for  the  possibility  of  a  sliding  mode  to  exist  on  a 
switching  boundary.  Sliding  modes  may  occur  in  any 
system  in  which  the  dynamical  equations  are 
discontinuous.  Much  research  can  be  found  on  this 
topic  under  the  more  general  title  of  variable 
structure  systems,  see  for  example  IS].  The  term 
"sliding  mode"  characterizes  the  behavior  of  a 
system  when  the  vector  fields  on  both  sides  of  a 
switching  boundary  point  towards  the  boundary.  A 
representative  point  is  directed  towards  the 
boundary  from  both  sides  and,  therefore,  is  forced 
to  move  (or  slide)  along  the  boundary.  Because  the 
systaai  is  constrained  to  lie  on  a  surface  with 
smaller  dimension  than  the  space,  e  reduced- order 
system  may  be  obtained.  Often,  the  resulting 
reduced-order  modal  has  many  properties  such  as 
robustness  and  invariance  to  disturbance  which 
makes  it  attractive  to  control  system  designers 
IS]. 

In  a  physical  system  with  discontinuous 
control,  a  representative  point  does  not  actually 
travel  along  the  switching  curve,  rather,  it 
"chatters"  along  the  curve.  The  chatter  is  caused 
because  an  actuator  cannot  switch  Instantaneously. 


It  may  switch  with  a  time-lag  or  may  act  as  a 
first-order  filter  so  that  a  representative  point 
actually  crosses  the  switching  boundary  into  the 
other  side  before  the  control  switches  to  direct  it 
back  again.  In  systems  which  are  linear  with 
respect  to  the  control,  the  limiting  behavior  of 
chattering  as  the  time-lag  goes  to  zero  is  the 
sliding  mode  where  the  switching  frequency  goes  to 
infinity  [5] .  For  the  purposes  of  this  paper,  the 
time-lag  for  switching  is  assumed  to  be  of  order 
0(c)  where  c«u.  In  this  way,  the  actuator 
dynamics  are  much  faster  than  the  fast  system 
dynamics.  (If  this  was  not  the  case,  then  the 
original  model  (i)-(2)  would  be  inadequate.)  Thus, 
the  system  displays  three  time-scales,  two  of  which 
(t  and  x)  are  of  interest.  Therefore,  setting  e-0 
yields  the  ideal  sliding  mode  equations. 

The  previous  theorems  proving  the  that  the  slow 
and  fast  models  approximate  the  actual  singularly 
perturbed  system  with  quantized  control  are  not 
applicable  when  sliding  occurs.  These  proofs 
relied  on  the  time  delay  between  boundary  crossings 
of  the  actual  solution  and  of  the  approximated 
solution  to  be  of  order  0(u).  Each  solution  then 
spent  a  nonzero  length  of  time  in  a  particular 
region  where  the  linearity  properties  kept  the 
approximation  error  to  be  of  order  0(u).  When 
sliding  occurs,  the  consecutive  time  spent  in  any 
one  region  is  zero  and  the  number  of  boundary 
crossings  in  any  finite  time  interval  is  infinite. 
Therefore,  the  phenomenon  of  sliding  must  be 
handled  separately. 

The  proof  of  the  following  theorem  shows  that 
if  sliding  occurs  in  tha  fast  time-scale,  then  the 
approximation  error  remains  of  order  0(u).  The 
case  of  sliding  in  the  normal  time-scale  will 
follow  as  a  consequence  of  this. 

Theorem  3:  Given  the  system  in  ( 5)-(6) , ( 10) - 

(11)  and  the  approximation  in  (12)  where  the  vector 
fields  on  each  side  of  a  switching  boundary 
intersect  the  boundary  with  an  angle  of  0(u°),  if 
either  of  the  systems  is  sliding  along  the  boundary 
and  if  K2B2  is  invertible  then  the  approximation 
error,  z(x)-z(x),  is  of  order  0(u), 

Proof:  It  is  clear  from  the  proofs  of  Theorem 

1  and  2  that  if  either  the  approximation  or  the 
actual  system  is  sliding  then  the  other  system  must 
also  be  sliding.  Hence,  it  suffices  to  show  that 
the  solutions  of  the  sliding  modes  of  the  two 
systems  differ  by  0(u).  The  method  of  equivalent 
control  [5]  will  be  used  to  find  the  sliding  modes 
of  the  systems.  Let  the  sliding  surface  for  the 
approximation  be  given  by  swKlx,+K2z-di»0.  If  the 
system  is  sliding,  then  ds/dx  ■  0. 

ft  "  *£  “  °  (l7> 

The  substitution  of  (12)  into  (17)  yields 

K2(A2x  +  B2u-q)  -  0 

where  the  equivelent  control,  u<q,  can  be  solved  as 

“eq  •  -(I2B2)  *K2A2z  (18) 

We  now  substitute  u-{j  for.  u  in  (12)  to  obtain  the 
sliding  mode  equations: 
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-  0 


(19) 
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d x 
At 

g*  -  (Aj  -  BjfKjB^'^jAj)*  (20) 

with  the  constraint  that  KjXj+KjZ-dj-O. 

The  sliding  aode  of  the  actual  system  sliding 
on  the  surface  saKtx+K2£  may  be  similarly  obtained 
using  - (5)-(6)  and  (10)-(11).  The  equivalent 
control  is  found  to  be 

ueq  “  +  uX^o )'l(XjAj*  +  tilCjA.x). 

The  substitution  of  ueq  for  u  in  (10)  and  (11) 
yields  the  sliding  mode  equations: 

gf  -  u(A0  -  liB0(tjBJ+tJC1B0  )*^XlAj  )x  (21) 
-  mBo(XjBj+uK1B0)‘1KjAj)£ 

g  -  -MBj(KjB3+uK1B0)'1K1A<1J  (22) 

+  (Aj  -  Bj (KjBj+uKiB, )_1K2Aj)i 

with  the  constraint  that  ^x+KjZ-d^O.  Note  that 
(21)  and  (22)  along  with  its  constraint  are  regular 
perturbations  of  the  sliding  mode  aquations  for  the 
approximate  system  (19)  and  (20)  with  its  con¬ 
straint.  Hence,  the  error  between  the  solutions  is 
of  order  0(u). 

A  sliding  mode  naturally  occurs  in  the  normal 
time-scale  every  time  a  boundary  hyperplane  is 
crossed.  Prior  to  a  boundary  crossing  in  the  t- 
time,  t*-®  sic**  ~'del  has  errors  of  order  C(^),  and 
the  quasi-steady-state  solution,  za,  is  given  by 
case  il)  of  the  definition  of  f  in  (13).  When  the 
solutions  x,  and  zs  hit  the  boundary,  the  condi¬ 
tions  for  using  case  iii)  are  satisfied  to  find  zs. 
As  mentioned  in  Section  2,  the  control  begins 
switching  rapidly  to  maintain  that  value  of  xs.  In 
essence,  the  system  satisfies  the  requirements  for 
sliding  in  the  r-time  but  has  reached  the  quasi¬ 
steady-state  solution  of  ss.  This  can  be  verified 
by  noting  that  the  value  of  z#  given  in  case  Hi) 
is  an  equilibrium  point  of  the  sliding  mode 
equation  in  the  fast  time  (20).  Note  that  since 
the  boundary  layer  solution  was  negligible  prior  to 
sliding  and  the  switching  occurs  very  quickly  (on 
order  of  0(c)),  the  boundary  layer  solution  remains 
of  order  0(u).  Since  zs  is  continuous  with  respect 
to  xs,  the  conditions  of  case  iii)  in  (13)  are 
satisfied  for  a  nonzero  length  of  time  in  the  t- 
ttme.  Hence,  the  system  must  slide  in  the  normal 
time-scale. 


Theorems  1  and  2  ere  restrictive  in  their 
application  due  to  the  requirement  that  the  vector 
field  cannot  cross  a  boundary  tangentially.  The 
following  theorem  for  continuous  systems  provides  a 
condition  for  the  accuracy  of  the  approximation 
that  is  easier  to  use  and  removes  the  above 
restriction.  First,  the  system  is  reformulated  and 
two  preliminary  results  are  given. 

Let  (3)  and  (4)  be  written  as 

x  ■  Al2x  +  Al2z  +  g2(x,z)  (25) 

uz  *  AjjX  +  A2jZ  +  gj(x,z)  (26) 

where  A 2 2®A2  2  ,  A2  2®A2  2  ,  A2  2®A2  2  ,  A2  2*A 2 2  ( the 

parameter  matrices  in  the  i«0  region),  and  gjfx.z), 
g2(x,z)  are  piecewise-linear  functions  defined  in 
region  i  as 

f1(x,z)  •  (Ajj*  -  All,)x  +  (Ajj1  -  A22°)z  +  Wjl 
g2(x,z)  ■  (Ajj1  -  A2l“)x  +  (Ajj1  -  A22°)z  +  Wji 

Lemma  1 :  If  A22  in  equation  (26)  is  stable, 
then  there  exists  real  positive  numbers  K  and  o 
such  that  le*JJtl  i  Ke*°  .  For  proof,  see  [6]. 

Laiana  2:  If  a  function,  g2(x,z),  is  piecewise- 
linear,  then  it  satisfies  a  Lipshitz  condition, 
l.e.,  there  exists  a  positive  real  number  k  such 
that 

■  gj(*.*)-*2(*.*>l  S  [*]  *  ||]  | 

For  proof,  see  [7], 

We  now  state  the  main  result: 

Theorem  4:  For  a  continuous  piecewise-linear 

singularly  perturbed  system,  the  error  in  the 
approximation  given  by  the  fast  model  (8)  is  of 
order  0(u)  if  k,X  and  o  as  defined  above  satisfy 
kKSo. 

Proof:  The  actual  system  is  given  in  the  fast 

time-scale,  i"(t-t0)/u,  by 

“  uAjjX  +  mA12z  +  Mg2(x,z) 

df  -  A21x  +  A„z  ♦  g,(x,z) 

(For  simplicity  of  notation,  z  is  denoted  as  z  and 
x  as  x.)  The  fast  model  approximation  can  be 
similarly  given  by 


The  sliding  mode  in  the  t-time  is  found  from 
the  quasl-steady-state  equivalent  control  of  the 
fast  system.  Replace  s  with  s,  in  (18)  and 
substitute  In  (18)  for  u  in  (14)  to  yield 

*,  -  A,x,  -  Ba(K,B,)’1K,A,sa  (23) 

Since  the  solutions  lie  on  a  boundary,  ta  as 
defined  in  case  ill)  of  (13)  may  be  substituted 
into  (23).  The  resulting  equation  is  the  sliding 
mode  in  the  normal  time-scale. 


3^  -  Aj j*  +  Ajj*  ♦  g2(x,z) 

Let  ej('t)  •  x(i)-x(t)  and  e2(t)  ■  t(t)-£(x),  then 
the  following  differential  equation  can  be  written. 

FS*i/d‘ll  .  f0  0  lf*il  +  ru(Ajlx+Al2z+g1(x,t))l 

[dej/dtJ  Laji  L8j(*.*>  *  «»(*.*)  J 


-  a,x,  ♦  *  a*dkn 

r,A,-iB, 

valid  on  the  RjXjHCjS^wdj+i  surface. 


The  following  solutions  are  obtained. 

ei(t)  -  ex(0)  +  u  j^(Allx(s)+A1Is(s)4gl(x,a))ds 


(24) 


•2(x)  -  -Al2'l(I-eAl*X)A2l*x(0>  +  •A»%1(0) 

+  uj"* -Aj,*l(I-*Al,X)Ail(Ailx(*)+A1,z(*)+«l(*,*))<J* 

♦  •A*j(,'*)(ga(x.«)  -  g2(x,z))ds. 

It  is  given  Chat  «1(0)  and  e2(0)  ar«  of  ordar  0(u). 
For  finita  x  and  boundad  parameters,  a1(x)«a1(0)+ 
0(u)“Oin).  Similarly,  the  first  integral  in  the 
second  expression  is  0(u)-  Using  Lama  1)  from 
above,  it  can  be  shown  that 

le2(x)l  S  Ka  cr*l«2 (0)1  + 

kJ*  e"°^x",^lgj(x,z)-gj(x,z)lds  +  0(u). 

From  I 2)  and  the  fact  that 

|*;|  s  '*i«  +  «•»«• 

the  following  expression  can  be  obtained: 

e^lejCx)!  S  Kl«](0)l  +•  UcP eos( le1l+lej  1  Ids  +  0(u) 

*  0 

This  is  reduced  using  the  results  from  [6]  to 
e^le^x)!  S  [Kle2(0)l  +■  0(u)]ekKx, 
which  finally  yields 

i  KI*J(0)le'(o‘kK)x  +  O(u). 

Hence,  if  kKSo,  then  te2(x)l  is  of  order  0(u). 


dx  ■  f 1(x,z)dt  +  gxdW,  x(t,)"x,  (27) 

udz  ■  fai(x)dt  ♦  fjj(x)zdt  +  v'ugjdW,  z(t,)»z,  (28) 

where:  x  and  z  are  scalar  variables;  W  represents  a 
Wiener  process  with  variance  parameter  Q;  gx  and  g. 
are  constants;  and  f2,  f2l  and  f22  are  continuous 
and  piecewise-linear.  As  in  the  deterministic 
case,  the  state  space  is  partitioned  into  regions 
where  the  functions  are  affine.  Note  that  the 
noise  input  to  the  second  equation  is  scaled  to 
preserve  the  well-posedness  of  the  fast  time  * 
problem.  See  [10]  for  a  discussion  of  this  problem 
in  linear  singularly  perturbed  systems. 

The  behavior  of  the  system  in  the  t  time-scale 
is  evaluated  by  expressing  (27)  and  (28)  as 

dx  ■  pf1(x,z)dx  +  i/ugjdW,  x(O)-x0  (29) 

dz  ■  f2l(x)dx  +  f22(x)zdx  +  gjdW,  z(0)-z0  (30) 

where  V  is  now  defined  as  a  Wiener  process  in  the 
x-time.  It  is  shown  below  that  this  system  may  be 
approximated  by  a  reduced-order  (fast)  model  of  the 
following  form: 

dz  ■  f2l(x,)dx  +  f 22(x0 )zdx  +  g2dW,  z(0)»z„  (31) 
Note  that  this  system  is  completely  linear. 

The  analysis  of  the  fast  system  focuses  on  the 
propagations  of  the  conditional  Joint  probability 
density  function  and  its  corresponding  character¬ 
istic  function.  Define  the  conditional  joint 
probability  density  function  as  p(x,z,x|x„ ,z„ ,0) 
and  the  characteristic  function  as 


The  application  of  this  theorem  requires 
obtaining  values  for  k,  K  and  o.  Results  from  (7] 
can  be  used  to  find  a  minimum  value  for  k. 

k  ■  maxi A2 2  *-A2 2 ,  A22^-A22l 
i 

where  IAI  is  the  maximum  singular  value  of  A.  The 
following  three  methods  may  be  used  to  find  values 
for  K  and  o  given  leATl  S  Ke*'”  where  A  is  stable. 

1)  If  A  is  diagonalizable  to  A  such  that  A“M"^AM, 
chen 

leAxl  S  IM*lIIM||eAx|  S  1H*1 1  IHIe"*” 

where  -o  is  the  lsrgsst  real  part  of  any  eigenvalue 
and  K-IM~1||MI. 

2)  Reference  [8]  shows  how  to  obtain  the  following 

values .  r  * 

Let  0(A)  “  max  |Xj^A+Aj|,  where  Xj  is  an  eigenvalue 
than  o»-0(A)  and  I”1 . 

3)  Ut  B-TAT*1,  then  K-ITIIT'1!  and  o— 0(B)  {9]. 

Thus,  Theorem  4  may  be  applied  without  prior 
knowledge  of  the  solution. 

5.  RANDOM  INPUTS 

The  effect  of  a  random  input  on  the  singularly 
perturbed  continuous  plecevise-linaar  model  is  now 
considered.  The  stochastic  modal  may  be  repre¬ 
sented  by  the  following  form. 


*(w,v,x|x0,z0,0)  -  jj  eJ(wx+vz)p(xi z , x | x„ ,z„ ,0)dxdz 


where 

p(x,z,x|x0,z0,0) 


J  *  j ( wx+vz )#(Wiv,x|x0,z0,0 )dvdv 


Because  the  system  is  piecewise-smooth,  a  Fokker- 
Plank  equation  which  holds  almost  everywhere  may  be 
derived  to  obtain  the  conditional  density  function. 

lx  “  "  + 

|g2lQ  |j2  p  +  0(uh  a.e.  (32) 

where  p»p(x,z,x |x„ ,z, ,0) .  The  propagation  of  the 
characteristic  function  is  derived  from  (32)  as 

-  H  e  ((f2l(x)+f jj(x)z)p]dxdz 


+  2*s*^  JJ  -J(W*+V*)  i-jpdxjj  +  0(M»)  (33) 

It  is  shown  below  that  a  solution  to  this  equation 

is 

•(w,v,x|x(,z#,0)-e^w*o$(v,x|x#,z#,0)  +  O(ui)  (34) 
or,  equivalently. 
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p(x,z,t|xo,z0.0)-4(x-xo)p(z,i|X|,,zo,O)  +  O(n^) 

where  p(z,t|x,,zc,0)  is  tha  conditional  probability 
density  function  of  the  fast  model  (31)  and 
♦( v,t jx0 ,z0 ,0)  is  its  corresponding  characteristic 
function. 


The  Fokker-Planck  equation  for  the  fast  model 
(31)  is 


-  m  3_ 
3x  3z 


[(f21(xa)+f ;J(x0)z)p] 


+ 


a.e. 


(35) 


where  p  -  p(z,t | x„ ,z„ ,0) .  An  equation  for 
♦(v,x|x0 ,z0 ,0)  is  obtained  from  (35)  as 


f“  ^eJV*  |^[(fIl(x<1)+fjj(xs)z)p]dz 

+  2*2lQ  |"eJVZ  fjjP  dxdz  06) 
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6.  SUMMARY 

Previous  results  on  the  singular  perturbation 
of  piecewise- linear  systems  are  extended  in  this 
paper.  Sliding  mode  equations  are  developed  in 
both  the  normal  time-  and  the  fast  time-scales  for 
the  case  of  the  quantized  control.  It  is  found 
that  the  occurrence  of  a  sliding  mode  does  not 
affect  the  validity  of  the  time-scale  separation 
procedure  given  in  an  earlier  paper.  A  new, 
nongeometrlc  theorem  is  given  to  prove  that  the 
approximations  developed  previously  for  the 
continuous  dynamics  case  are  accurate  to  within  an 
error  of  order  0(m).  This  theorem  is  easy  to  apply 
and  is  less  restrictive  «.n  its  assumptions. 
However,  because  all  tha  theoraaw  provide  suffi¬ 
cient  but  not  necessary  conditions,  none  supercedes 
the  others.  Finally,  the  effect  of  a  random  input 
on  a  particular  continuous  piecewise- linear  system 
is  analysed.  A  reduced-order  model  approximating 
ibe  system  in  the  boundary  layer  is  developed. 
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ABSTRACT 

The  effect  of  random  inputs  on  a  continuous  piecewise-linear 
singularly  perturbed  system  is  investigated  in  this  paper. 
Reduced-order  models  are  developed  for  a  second-order  system  (one 
fast  and  one  slow  variable)  which  has  a  random  input.  It  is  shown 
that  the  solutions  of  the  reduced-order  models  approximate  the 
actual  solution  with  differences  in  probability  density  functions 
of  order  0(p2)  (in  a  distributional  sense).  For  the  special  case 
of  a  system  which  is  linear  in  the  fast  variable,  it  is  shown  that 
the  mean-squared  error  between  the  approximate  and  actual 
solutions  in  the  fast  time  scale  is  of  order  0(p).  An  outline  is 
provided  for  the  extension  of  the  results  to  the  vector  variable 
case . 


1.  INTRODUCTION 

A  system  inherently  possessing  both  fast  and  slow  dynamics 
can  often  be  simplified  by  using  singular  perturbation  theory  to 
separate  the  system  into  reduced-order  models,  one  containing  the 
fast  dynamics  and  one  containing  the  slow  dynamics.  The  standard 
theory,  however,  is  restricted  to  systems  with  smooth  dynamics  [1- 
3).  Recently,  this  theory  has  been  extended  for  deterministic 
systems  to  piecewise-linear  systems  [4,5].  Piecewise-linear 


singularly  perturbed  systems  appear  in  many  applications  including 
flight  controls  and  electrical  circuits.  The  piecewise-linearity 
may  occur  from  a  piecewise-linear  element  such  as  a  saturation  or 
dead  zone  or  may  occur  as  a  result  of  a  piecewise-linear 
approximation  of  a  nonlinear  system.  It  is  desireable  to  extend 
singular  perturbation  theory  to  piecewise-linear  systems  with 
random  inputs. 

Reduced-order  models  for  linear  singularly  perturbed  systems 
with  Gaussian  random  input  have  been  developed  in  [6,7]  for 
filtering,  smoothing  and  control  purposes.  The  filtering  problem 
for  smooth  singularly  perturbed  nonlinear  systems  with  wide-sense 
stationary  random  input  is  discussed  in  [8].  Reduced-order 
filters  are  designed  for  the  smooth  nonlinear  system  corresponding 
to  the  fast  and  slow  dynamics.  This  paper  extends  the  previous 
work  on  singular  perturbation  theory  in  piecewise-linear  systems 
to  the  case  of  random  inputs  for  possible  use  in  filtering, 
smoothing  and  stochastic  control.  An  example  of  this  application 
is  a  singularly  perturbed  piecewise-linear  flight  control  system 
which  has  random  wind  disturbances. 

1.1  Problem  Formulation 

The  system  investigated  in  this  paper  is  continuous  and 
piecewise-linear  with  a  random  input.  Since  the  resulting  system 
is  nonlinear,  the  model  is  written  in  terms  of  coupled  Ito 
differential  equations: 


dx  =  fj(x,z)dt  +  gjdW 

(1) 

udz  =  f2(x,z)dt  +  Vdg2^w 

(2) 

where:  x  and  z  are  scalar  variables;  W  represents  a  Wiener  process 
with  variance  parameter  Q;  gj  and  g£  are  constants;  fj  and  f2  are 
continuous  and  piecewise-linear  functions.  As  in  the 
deterministic  case  studied  in  [4],  the  state  space  is  partitioned 
into  regions  where  the  functions  are  affine.  Let  the  system  be 


(3) 


defined  in  the  region  as  follows: 

dx  =  A^j^xdt  +  A^^zdt  +  gidW 
udz  =  A22'*'xdt  +  A22^zdt  +  Vug2<3W  C4) 

The  superscripts  simply  denote  the  region  number.  For  simplicity, 
the  regions  are  restricted  to  be  nonoverlapping,  nonempty  and 
parallel.  By  parallel,  it  is  meant  that  the  boundaries  of  the 
regions  are  parallel  hyperplanes. 

The  random  input  to  the  fast  subsystem  is  assumed  to  be 
scaled  by  so  that  the  well-posedness  of  the  problem  is 
preserved.  It  has  been  shown  by  Khalil  et.  al.  [9]  that  the  well- 
posedness  is  questionable  unless  the  white  noise  input  to  the  fast 
variable  is  scaled  by  a  factor  of  order  0(pa)  where  0 <a£|  or  wide¬ 
band  noise  is  used  instead.  The  problem  occurs  with  unsealed 
white  noise  because  as  p-*0  the  bandwidth  of  the  fast  subsystem 
approaches  infinitity,  so  that  the  fast  variable  acts  like  white 
noise  in  the  normal  time-scale.  This  is  valid  as  an  input  to  the 
slow  model  but  not  as  a  dynamic  process  itself.  Scaled  random 
inputs,  however,  do  not  exhibit  this  problem. 

The  outline  of  the  paper  is  given  as  follows.  In  Section  2, 
the  behavior  of  the  system  in  the  fast  time  scale  is  discussed. 
Aiso,  a  reduced-order  model  is  developed  which  is  valid  in  the 
fast-time  scale.  Similarly,  a  reduced-order  model  is  developed  in 
Section  3  to  approximate  the  slow  dynamics  of  the  system  with 
respect  to  the  nonrial  time-scale.  The  extension  of  the  second- 
order  analysis  to  higher  order  systems  is  outlined  in  Section  4. 
Concluding  remarks  are  included  in  Section  5. 

2.  FAST  SUBSYSTEM 

The  behavior  of  the  system  in  the  fast  time-scale  is 
investigated  below.  At  each  sample  time,  t^,  of  the  normal  time 
scale,  the  fast  subsystem  may  be  evaluated.  Define  the  expanded 
time  variable  by  x*(t-t^)/p  and  restrict  the  samples  so  that 


is  large  relative  to  y.  The  original  system  in  ( 1 ) - C 2 )  is 
reformulated  in  terms  of  x  as  follows 


dx  =  yf i (x , z  )dx  +  VugidW; 

XI 

o 
^ — ' 

II 

X 

o 

(5) 

dz  =  f2(x,z)dx  +  g2dW; 

z(0)  =  z0 

(6) 

where  Vi  is  now  defined  as  a  Wiener  process  in  the  x 

time-scale 

given  as  W(x)=y^W( yx) .  It  is  shown 

that  this  system  may  be 

approximated  by  the  solution  to  a  reduced 

the  following  form: 

-order  (fast) 

model  of 

dz  =  f 2(x0 , z )dx  +  g2dW; 

z(0)  =  z0 

(7) 

x(x)  =  x0 

The  proof  that  the  resulting  approximation  error  is  of  order 
0(y)  focuses  on  the  propagations  of  the  conditional  joint 
probability  density  function  and  its  corresponding  characteristic 
function.  Define  the  conditional  joint  probability  density 
function  for  the  solution  of  (5)-(6)  as  p(x,z;x|x0 ,z0 ;0)  and  the 
characteristic  function  as 

+<*> 

4>(v,w;x|x0  ,z0  ;0)  =  JJ  ej(vx+wz)  p(x, z ; x |x0 , z0 ; 0)dxdz  (8) 

-  CO 

where 

-GO 

p(x,z;x|xo,z0;0)  =  ^~2  JJ  e  ^VX+WZ^4>(v,w;x|x0  ,z0  ;0)dwdv  (9) 

—  GO 

Because  the  system  is  continuous  and  piecewise-smooth,  a 
Foxker-Planck  equation  which  holds  almost  everywhere  may  be 
derived  to  obtain  the  conditional  joint  probability  density 
function: 


If  =  -  [f2(x,2)p]  +  7  822q|j!p  -  v  |j  + 


(10) 


1  »!*«  £’  +  *  -* 


2  M2gl82Q  3x3z  P  +  o 


2  M^g^Q  azax 


a .  e . 


where  p  =  p(x , z ; x | x0 , z0 ; 0) .  The  equation  holds  everywhere  except 
for  the  set  of  measure  zero  where  the  dervivative  of  f^  and  f?  do 
not  exist.  The  initial  condition  (in  a  distributional  sense)  and 
auxilary  conditions  are 


p(x,z;0jxo,zQ;0)  =  6( x-x0 )6(z-z0 )  ; 


p  1  0  and 


II p 


dxdz  =  1 


(11) 


(For  a  discussion  of  the  derivation  of  the  Fokker-Planck  equation, 
see  Wong  111]-) 

Examination  of  (10)  shows  that  the  propagation  of  p  is 
relatively  insensitive  to  the  variation  of  x.  Since  this  is  a 
linear  partial  differential  equation,  the  methods  of  Kato  [12]  can 
be  used  to  show  that  the  solution  of  (10)  can  be  approximated  by 
the  solution  of  the  following  equation  with  errors  in  the  solution 
of  order  O(’jz)  ( in  a  distributional  sense). 


^£a  = 
3x 


a_ 

9z 


[f2(x,z)pa] 


+  2  822Q  dz2?a 


1 


a. e .  ( 12 ) 


where  pa=pa(x,z;x|x0 ,z0 ;0).  The  initial  and  auxilary  conditions 
remain  the  sar*  (again,  the  initial  conditions  are  defined  in  a 
distributional  sense): 


pa(x,z;0|x0,z0;0)  =  6(x-x0  )6( z-z0 )  ; 


JJ  Pa  dxdz  =  1 


(12 


\ 

f 


pa  £  U  and 


Hence,  p  =  pa  +  O(g^)  in  distribution. 

To  remove  the  consideration  of  the  differential  equation 
being  defined  almost  everywhere,  the  propagation  of  the 
characteristic  function  is  introduced.  Denote  the  characteristic 
function  of  pa  as  $>a( v, w ; x  |  x0 ,  z0  ;  0 )  where  a  characteristic 
function  is  define  1  in  equation  (8).  Then  an  expression  yielding 
the  propagation  of  <t>a( v,w; x 1  x0  ,  z0  ;  0)  can  be  found  from  (12)  by 
first  multiplying  both  sides  of  the  equation  by  e3vx+3wz  and  then 
integrating  with  respect  to  x  and  z. 


■  ejvx+jwz  [-  [f2(x,z)na]  +  |  g22Q  |^2paj  dxdz  (14’ 


The  values  of  the  derivatives  can  be  assigned  arbitrarily  for 
points  in  the  set  of  measure  zero  where  the  derivative  is  not 
defined.  Since  the  righthand-side  of  (12)  multiplied  by  ejvx+Jwz 
differs  from  the  integrand  in  (14)  on  a  set  of  measure  zero,  the 
right-hand  side  of  (14)  is  equal  to  the  integration  with  respect 
to  x  and  z  of  the  right-hand  side  of  (12)  multiplied  by  ejvx+2wz 
(for  proof,  see  [13]).  The  corresponding  initial  condition  is 
<l>a(v,w;0|x0  ,z0  ;0)=eJvxoeJwzo  and  the  auxilary  conditions 
correspond  to  (13),  i.e.,  <J>a(0, 0 ; x | x0 , z0 ; 0)=1 . 

Similar  to  the  case  for  the  actual  solution,  a  Fokker-Planck 
equation  .an  be  derived  to  find  the  conditional  probability 
density  function  for  the  approximation  given  in  (7) 

• 

fx  =  "  U2^x°,z^  +  2  822(^  fz2^  a-e'  *'15^ 

where  p  =  p( z ; x | xr , z0 ; 0 ) .  This  is  subject  to  the  following 
initial  and  auxilary  conditions: 

+“> 

J  p  dz  *  1 


p( z ; x | x0 , z0 ; 0 )  *  6(z-z0);  p  &  0  and 


(16) 


Denote  the  characteristic  function  for  p  as  4>(w;x  |x0  ,z0  ;0) .  The 
propagation  equation  for  <t>( w; x  |  x0  ,  z„  ; 0)  is  found  from  (15)  to  be 

-  +“ 

e^WZ  [  '  fz  tf2(xo*z)P^  +  \  §22Q  h^]  dZ  (17) 

where  the  initial  condition  is  $( w; 0 | x0 , z0 ; 0 )=el wzo . 

A  comment  can  be  made  about  the  steady-state  value  of  p.  It 
is  assumed  that  the  system  in  (7)  is  stable  so  that  a  steady-state 
solution  for  p  exists.  It  can  be  found  by  setting  the  time 
derivative  to  zero  in  the  Fokker-Planck  equation.  The  steady- 
state  can  then  be  solved  from  the  resulting  equation: 

[f2(x0,z)pj  =  |  g22Q  p^  a.e.  (18) 


94> 

9x 


where  p^  =  Pa,(z|x0).  Note  that  p,*,  can  be  considered  as  a  function 
of  x0,  but  continuity  of  that  function  is  not  guaranteed. 

It  can  now  be  shown  that  the  joint  probability  density 
function  given  by  the  solution  to  (12)  is  equal  to  pa=6(x-x0)p, 
or,  equivalently,  <t>a=eJvx°<t>.  The  expressions  for  pa  and  <ta  are 
substituted  into  (14)  to  yield 


9S> 

9x 


eJvxo 


J'  ejvx+jwz 


(  f  2 ( x , z )6( x-x0 )p  ] 


+ 


2  g22Q 


9z2 


[6(x-x0 


dxdz 


(19) 


Integration  with  respect  to  x  yields 


9®  ejvx0  =  ejvx0 
9x 


jwz 

■4° 


[-  [f2(x0,z)p]  +  \  g22Q  g2p]  dz  (20) 


Since  the  pair  (p ,<t>)  is  a  solution  to  (17)  it  must  also  be  a 


solution  to  (19).  Therefore,  pa=6(x-x0)p  and  $>a=e-ivx°<I>. 

Finally,  the  assertion  that  the  probability  density  function 
of  the  solution  to  the  approximate  model  differs  from  that  of  the 
true  solution  by  factor  of  order  O(m^)  is  proven  in  a 
distributional  sense.  Since  p=pa  +  0(p2)  (in  distribution),  the 
results  of  the  preceeding  paragraph  imply  that  p=6(x-x0)p  +  O(p^) 
(in  distribution).  Correspondingly,  +  0(u).  Hence,  the 
statistical  moments  of  the  true  solution  and  the  approximation 
differ  only  by  an  error  of  O(p^). 

2.1  Systems  Linear  in  z 

It  can  be  further  shown  that  for  a  system  which  is  linear  in 
z,  the  mean-squared  error  between  the  actual  solution  and  the 
approximate  solution  is  of  order  0(y).  A  continuous  piecewise- 
linear  system  that  is  linear  in  z  has  the  following  form: 


dx  =  ufji(x)dx  +  pA-^zclx  +  Ju  gjdW 

(21) 

dz  =  f2^(x)dx  +  A22zdt  +  g2^ 

(22) 

where  f-Q  and  f2\  are  continuous  piecewise-linear  functions;  A^. 
A22 *  and  g2  are  constants  and  W  is  a  Wiener  process  defined  in 
x  with  variance  parameter  Q.  This  is  simply  a  subset  of  systems 
of  the  general  form  given  in  (5)-(6).  Note  that  the  requirement 
that  Aj2  and  A22  be  constant  is  a  consequence  of  the  continuity  of 
the  system.  Also,  stability  of  the  fast  model  is  required  in  this 
analysis,  hence,  A22  is  stable.  The  process  is  ill-defined  if  A22 
is  not  stable. 

Examination  of  (21)  shows  that  x  stays  relatively  constant 
with  respect  to  x  and  can  be  approximated  by  x0.  The 
approximation  for  z  is  given  by  the  solution  to  the  following 
equation 


dz  =  f2i(x0)dx  +  A22zdt  +  g2^W 


(23) 


To  show  that  the  mean-squared  error  between  z  and  z  is  oi  order 
0(m),  define  the  approximation  error  as  <p(x)  =  z  -  z.  Then  an 
equation  for  <p  is  given  by 

d(p  =  (f2i(x)-f2i(x0))dT  +  A22<p<}t;  «p(  0 )  =  0  (24) 

The  solution  to  (24)  due  to  linearity  is  given  by 

x 

<p(x)  =  J  e  A22^x  °^[f?1(x)-f21(x0))  do  (25) 

o 

An  upper  bound  for  <p  can  be  found  by  noting  that  f 2 2  satisfies  a 
Lipschitz  condition;  hence,  there  exists  a  positive  constant  k<“ 
such  that: 


x 

<p(x)  S  J  eA22^x  °)  kllx-x0B  do  (26) 

0 

The  mean-squared  error  of  <p  is  found  from  (25)  to  have  an  upper 
bound  as  follows: 


E{cp2(x)}  2 


f  eA 22(x*o)  eA22(^-e) 
0  0 


k  E II  [  x  ( o  )  -  xo][x(0) 


x0l  dod©  (27) 


Since  A22  is  stable  and  the  quantity  x(x1)-x0  is  of  order  0(m)  for 
0Sx1Sx,  the  integrand  is  found  to  be  of  order  0(u)-  Hence, 
E{<p2(c  )}  is  of  order  0(u). 


3.  SLOW  SUBSYSTEM 

The  slow  dynamics  of  the  system  ( 1 ) - ( 2 )  can  be  approximated 
in  distribution  by  the  solution  of  the  following  model: 


dxs  =  f2(xs,zs)dt  +  g^dW ; 


(28) 


xs(t0)  =  x  Q 


0  =  f2(xs,zs) 

Note  that  zg  is  found  from  xs  using  the  Katzenelson  algorithm 
given  in  reference  [4].  This  algorithm  is  computationally 
efficient  for  solving  algebraic  piecewise-linear  expressions.  The 
approximation  is  validated  below  by  showing  that  the  true  joint 
probability  density  function  of  x  and  z  differs  from  that  of  xg 
and  zg  by  a  factor  of  O(p^)  (in  distribution).  This  approximation 
is  shown  to  be  valid  outside  of  the  initial  boundary  layer  as  long 
as  the  solution  does  not  cross  into  another  region  of  the  state 
space.  A  boundary  layer  may  need  to  be  evaluated  after  each  time 
the  solution  crosses  a  boundary  between  regions. 

The  Fokker-Planck  equation  yielding  the  joint  probability 
density  function  of  the  actual  solution  given  in  (l)-(2), 
p(x,z; t |x0 ,z0 ;t0) ,  can  be  derived  using  an  approach  similar  to  one 
found  in  111].  The  Chapman-Kolmogorov  equation  is  the  starting 
point . 


p(x, z ; t+A | x0 , z0 ; t )  = 


>  r 

p(x,z;t+A|x1,z1;t)p(x1>z1;t|x0,z0>t0)  dxjdzj  (29) 

J  J 


An  expression  for  p(x,z;t+A jxj ,zj ;t)  is  found  using  the 
characteristic  function.  Define  the  characteristic  function  as 


4>(v,w;t+A 


xl> 


gjvx+jwz 


p(x,z;t+A|x2»Zj ;t)  dxdz 


(30) 


where 


*00 


-jvx-jwz 

e 


p(x,z;t+A|x1,z1;t) 


4>(v,w;  t+A  |xj  ,zj  ;  t )  dvdw  (31) 


Expand  the  following  term  in  a  Taylor  series  about  x=x^. 


e  -^xl  x^  =  1  -  jv(xj-x)  +  j  (x^x)2  +  h.o.t.  (32) 

This  series  is  substituted  into  (30)  to  yield: 

+«> 

*  .  2 

$( v , w ; t+A  | X j , z j  ; t )  «  eJWZ  [  1 -jv(x1~x)+  ^  (xj-x)2  ] 

-  OO 

x  p(x,z;t+A|x^,z^;t)e',VXl  dxdz  (33) 

Integration  with  respect  to  x  yields 
®(v,w; t+A |xl » zl » t)  = 

+°° 

J  e^WZ  e^VXl  ri-jvfi(xi,z1)A  +  |  g12QA]p(z;t+A)xJ ,zj ;t) ]  dz  (34) 

—  oo 

To  solve  for  p(z ; t+A Jx: , z x ; t ) ,  a  boundary  layer  following  t 
may  be  evaluated.  Define  a  new  time  scale  by  x=A/u  and  let 
z(x)=z( ux+t) .  Then  it  is  found  using  the  derivation  in  Section  2 
that  an  O(u^)  approximation  for  the  conditional  probability 
density  function  of  i,  p( z ;x |xj ,zj ; t) ,  is  given  by  the  solution  to 
the  following  Fokker-Planck  equation. 

It  =  '  k  [f2(x!’z)P]  +  2  QS22  0  a-e-  (35) 

where  p=p( z ; x | xj , zj ; t )  and  p(z;0 | xj , Z\ ; 0)=6(z-z0 ) .  Assuming  that 
the  system  is  stable,  the  probability  density  function  reaches  a 
steady-state  in  x  denoted  as  p(z|xj).  For  a  small  value  of  A,  x 
stays  relatively  constant  so  that  p( z , t+A | x^ , z j , t )  is  approximated 
(in  distribution)  by 


pCz.t+Alxj  ,Z2‘,t)  =  pCzJxj)  +  0(m) 


(36) 


Restrictions  on  the  O(y)  term  arise  due  to  the  fact  that  both 
pCzit+Alxj.z^'.t)  and  plzjx^)  are  probability  density  functions  so 
they  must  satisfy  certain  conditions.  One  problem  with  this 
analysis  is  that  p(z\x^)  may  not  depend  continuously  on  for 
those  X}  which  lie  on  a  boundary  between  regions  in  the  state 
space.  If  p  is  not  continuous  with  repect  to  x^,  then  the 
conditional  moments  of  z  may  not  be  continuous  either.  The  fast 
dynamics  may  then  be  excited  sufficiently  so  that  the  slow  model 
approximation  is  not  valid  when  a  boundary  is  crossed.  Since 
proving  continuity  of  p  with  respect  to  Xj  may  be  difficult,  it 
may  suffice  for  many  practical  applications  to  show  only  that  the 
mean  and  covariance  of  z  are  continuous  as  a  function  of  x^  if  the 
higher  order  moments  are  negligible. 

Once  an  expression  in  (36)  is  obtained,  it  can  be  substituted 
into  (34)  and  the  resulting  expression  then  substituted  into  (31) 
to  yield  an  expression  for  p(x,z;t+A|xj ,zj ;t) : 


p(x,z;t+A|x1,z1;t) 


4^11  6  jVA  ejVXl+jwy[l-jvf1(x1,z1)A+  |  gx  2QA  ] 


*  p(y|xj)  dydvdw  +  0(y) 


This  is  then  substituted  into  the  Chapman-Kolmogorov  equation  (28) 
to  yield 


X 


Note  that  the  0(p)  term  is  placed  outside  of  the  integrals  since 
all  of  the  integrals  correspond  to  taking  expectation  or 
transformation.  The  expected  value  of  the  perturbed  quantity  is  a 
perturbation  of  the  expected  value  of  the  quantity. 

Since  the  integrand  in  equation  (38)  is  continuous,  the  order 
of  integration  may  be  interchanged.  Integration  with  respect  to  w 
yields 

p(x,z;t+A|x0 ,z0 ;t)  = 

6  jVX  ejVXl  tl-jvf1(x1,z1)A+  \  gx2QA] 

-oo 

*  p(y|x1)p(x1,z1;t|x0,z0;t)  6(z-y)  dydvdx^Z}  +  0(u)  (39) 

Integration  with  respect  to  y  yields  the  following: 
p( x , z ; t+A | x0 , z0 ; t )  = 

jJJ  \  e  jVX  ejVXl  [l-jvf1(x1,z1)A+  |  gjaQA] 

-OO  -oo 

x  p(z|x1)p(x1,z1;t|x0,z0;t)  dvdx^dzj  +  0(u)  (40) 

Integration  with  respect  to  Zj  yields 

+® 

«  •  .  2 
p( x ,  z ; t+A  |  x0  ,  z0  ;  t )  =  ^  e  eJVXl  [  1  - j vf  ^Xj  , z1  )A+  ^  g22QA] 

—  OO 

x  p(z|x1)p(x1;t|x0 ,z0 ;t)  dvdxj  +  0(p)  (41) 

where  z^  is  defined  by 


0  =  f2(x2,zj) 


(42) 


To  obtain  p(x;t+A|x0 ,z0 ;t) ,  integrate  (41)  with  respect  to  z  on 
both  sides  to  yield 


p(x;t+A|x0  ,z0  ;t)  =  e  ^VX  e^1  [  1  -  jvf  ^  (xj  ,  zj  )A+  j  g^QA) 


x  p(x^;tjx0jZ0;t)  dvdx^  +  0(u) 


(43) 


The  expression  in  (43)  is  evaluated  with  the  following  result: 


£ 

p(x;t+A|x0 ,z0 ;t)  =  p(x;t|x0 ,z0 ;t)  -  —  f  (x,z)p(x; t | x0 , z0 ; t )A 

1  a2 

+  ^  8j2Q  ^2p(x;tjx0,z0;t)A  +  0(u)  a.e.  (44) 

where  z  is  defined  by  0=f2(x,z).  As  A-*0,  this  expression  becomes 
the  usual  Fokker-Planck  equation  where  p~p(x, t |x0 , z0 ; t0 ) : 

|^  =  -  |^  [f1(x,z)p]  +  ~  g^2Q  |^f  +  0(M)  a.e.  (45) 


The  initial  condition  (in  a  distributional  sense)  and  auxiliary 
conditions  are 


p(x;t0 |x0,z0;t0)  =  6(x-x0);  p£0; 


pdx  =  1 


(46) 


The  Fokker-Planck  equation  of  the  slow  approximation  is  given 
by 


lEs  =  . 

9 1 


1 


(x 


s» 


zs)psl 


a.e.  (47) 


where  ps=ps( x ; t | x0 ; t0 )  and  ps(x;t0 |x0 ;t0 )=6(x-x0 )  (in  a 
distributional  sense).  Since  (45)  is  a  regular  perturbation  of 


(47)  satisfying  the  same  initial  and  auxilary  conditions,  the 
solutions  are  found  to  differ  by  0(p),  i.e., 

p(x;t|x0;t0)  =  ps(xs;t|x0;t0)  +  0(p)  (48) 

Hence,  for  statistical  purposes,  x  can  be  approximated  by  xg  with 
approximation  errors  in  the  probability  density  functions  of  order 
0(p)  in  distribution. 

4.  EXTENSION  TO  VECTOR  VARIABLES 

The  results  of  Sections  2  and  3  are  directly  extendable  to 
the  vector  variable  case.  The  fast  subsystem  approximation  is 
given  by  equation  (7)  where  z  and  x0  are  vectors  variables  (zcRr 
and  x0eRm)  and  f2  and  g2  are  vectors  of  appropriate  length. 
Similarly,  the  slow  subsystem  is  given  by  equation  (28)  where  xs 
and  zg  (xseRm  and  zeRr)  are  vector  variables  and  fj,  f2,  g^  are 
vector  valued  functions  of  appropriate  length. 

It  is  still  possible  to  show  that  the  errors  between  the 
probability  density  functions  of  the  fast  subsystem  and  the 
solution  are  of  order  O(p^).  The  Fokker-Planck  equation  for  the 
true  probability  density  function  is  generalized  from  the  previous 
case  as  follows: 


ip 

9x 


r  r 


■  '  1  *  5  j  l  <*2TQS2>i,j 

i=l  i=l  j  =  l  J 


'  M  £  [f1(x,z)P]  +  P  ^ 

i  =  l 


2  L  l  (8ilQgi)i.j 

i*l  j=l 


(49) 


m  r 


+  ^  2  l  I  (glTQg2)i,j  9x.3zj 

i=l  j=l 


+  \  l  I  <e2lQ8l>i.j 

i-1  J-l  J 


where  p=p(x , z ; x | x0 , z0 ; 0 )  and  x^  and  Zj  are  components  of  the  x  and 
5  vectors.  The  propagation  of  p  is  relatively  insensitive  to 
variation  ii  any  of  the  components  of  the  x  vector.  Hence, 
following  the  previous  analysis,  it  can  be  easily  seen  that  the 
probability  density  function  of  the  fast  subsystem  approximates 
that  of  the  true  solution.  For  systems  linear  in  z,  the  analysis 
in  Section  2.1  is  extended  in  a  straighf orward  manner. 

Most  of  the  analysis  in  Section  3  for  the  slow  subsystem 
involves  the  derivation  of  the  Fokker-Planck  equation.  Since  the 
Fokker-Planck  equation  has  been  derived  for  the  vector  case  in 
[11],  these  steps  generalize  accordingly.  The  only  difference 
between  the  analysis  in  Section  3  and  previous  work  is  the 
solution  for  p( z ; t+A | , zj ; t )  in  (34).  This  can  be  approximated 
by  the  steady-state  of  the  probability  density  function  for  the 
fast  subsystem  found  from  the  corresponding  Fokker-Planck 
equation . 

5.  SUMMARY 

Reduced-order  models  are  developed  in  this  paper  which 
approximate  the  original  system  both  in  the  fast  time  scale  and  in 
the  slow  time  scale.  It  is  shown  that  the  approximations  are 
valid  in  terms  of  the  statistical  information  cf  the  true 
solution.  It  is  further  shown  for  systems  that  are  linear  in  the 
fast  variable,  that  the  fast  subsystem  approximates  the  true 
solution  with  a  mean-squared  error  of  order  0(p). 
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Abstract 

In  th'S  paper,  the  analysis  for  a  singularly  perturbed 
linear  system  with  quantization  in  the  feedback  loop  is 
performed.  It  is  found  that  the  system  has  variable 
structure  and  can  exhibit  sliding  behavior  on  the  switching 
surfaces.  Because  the  system  is  nonsmooth  and  standard 
singular  perturbation  techniques  are  not  applicable,  a  new 
technique  is  developed  for  a  two-input  case  to  obtain  the 
boundary  layer  solution  and  the  outer  solution.  A  discus¬ 
sion  of  the  approximation  error  is  included.  The  tech¬ 
nique  developed  is  successfully  illustrated  on  a  numerical 
example. 


perturbation  analysis  performed  on  the  decoupled  coor¬ 
dinates.  For  the  purposes  of  this  paper,  it  is  also  assumed 
that  the  system  may  be  transformed  to  decoupled  coordi- 


represented  by: 

t  =  r  B„U, 

m  •  to 

(1) 

fU!  =  A 2r?  +  B2u, 

1(0)  =  % 

(2) 

where  feRp,  t)€Rr,  ueR2,  ^>0  is  small  and  A2  is  Hurwitz. 
Define  the  control  vector  to  be 


1.  Introduction 

Singular  perturbation  theory  is  an  asymptotic  ap¬ 
proximation  sch  me  used  to  simplify  systems  which 
contain  both  fast  and  slow  dynamics.  These  types  of 
systems,  termed  "numerically  stiff,"  are  often  difficult  to 
analyze  numerically  due  to  ill-conditioning  in  the  system 
matrices.  Singular  perturbation  theory  removes  the 
numerical  problem  by  separating  the  system  into  reduced- 
order  models,  one  containing  the  fast  dynamics  and  one 
containing  the  slow  dynamics.  This  theory  has  received 
considerable  attension  in  the  past  thirty  years  (see  the 
surveys  given  in  References  [1-3]).  However,  the  common 
restriction  placed  on  systems  for  using  singular  perturba¬ 
tion  theory  is  that  the  svstem  dynamics  must  be  smooth 
[1-3]. 

In  many  systems,  the  actuators  supply  inputs  with 
discrete  rather  than  continuous  values,  i.e.  the  input  is 
quantized.  Examples  of  these  types  of  actuators  include 
relays,  stepper  motors,  and  certain  types  of  hydraulic  and 
pneumatic  devices  [4,5].  The  resulting  control  is  discon¬ 
tinuous  with  respect  to  the  state  variable,  hence  the 
system  is  nonsmooth  and  standard  singular  perturbation 
techniques  are  not  applicable.  The  basic  theory  of 
singular  perturbation  is  extended  in  this  paper  to  the  case 
of  a  quantizer  exisung  in  a  two-dimensional  feedback 
loop.  The  scalar  case  was  developed  separately  in  Refer¬ 
ence  [6],  Note  that  the  discontinuous  control  causes  the 
system  to  be  a  variable  structure  system. 

Intuitively,  discontinuities  in  the  control  would  seem 
to  excite  the  fast  dynamics  in  the  same  wav  us  would  a 
step  input.  However,  it  is  found  that  under  very  mild 
restrictions,  the  slow  system  slides  along  the  switching 
surface  instead  of  cross. ;.g  through  it.  Therefore,  there 
are  no  jumps  in  the  quasi-steady-state  solution  which 
would  cause  the  fast  dynamics  to  respond  with  a  step 
response  outside  of  the  initial  boundary  layer. 

Li  Problem  Formulation 

The  system  under  consideration  is  assumed  '  be 
linear  and  time-invariant.  It  was  determined  in  [6]  that, 
for  the  scalar  quant' zed  control  case,  the  svstem  may  be 
transformed  into  decoupled  coordinates  and  the  singular 


_  f  q^-K^-K,^)  | 

.  qa(-K2ie-K22r7)  j 


(3) 


where  K,,,  K12,  Kj,  and  Kj2  are  row  vectors  and  q,  and 
qj  are  quantizer  functions  defined  as  follows. 


q,(x)  =  c,  ,  for  d,  ,  s  x  <  d,  i»l n  (4) 

q2(x)  =  for  d2 \  s  x  <  d2j„;  j  =  1, — k 


The  parameters  are  specified  such  that 

CZ,i<CZ,i-l’  d1,l<d1,i»1>  d2,I<d2,i-i>  d1,l  = 
dl,n.1  =  d2.k.1  =  ‘  «• 


This  system  is  a  variable  structure  system  with  nk 
possible  linear  subsystems.  The  state  space  (R?*r)  can  be 
partitioned  into  nk  nonoverlapping  regions  defined  by 


b,j  =  {(e,b):  q1(-K1Te-K,2n)  =  c1  ,  and 

q2(-K2ie-K22P)  =  c2.J}  (5) 


The  boundary  between  two  regions  is  a  convex  partition 
of  a  hyperplane  defined  by  =  d,  ,  for  some  i 

or  -K2,£-K22p  =  d2  -  for  some  j.  Note  that  if  K^yfCj, 
and  K.2^Kj2  then  each  of  the  r.gions  (except  those  of  S,- 
Vj,  S(,  Vi,  Snj  Vj,  and  Sik  Vi)  is  bordered  by  two  sets  of 
parallel  hyperplanes. 

For  the  purpose  of  this  paper,  it  is  assumed  that  K,2 
and  K?2  are  not  of  order  O(m).  If  either  of  the  quantities 
was  ot  order  O(ti)  then  the  slow  manifold  would  be 
nearly  discontinuous.  In  particular,  the  quasi-steadv-state 
solution  would  be  a  discontinuous  function  of  the  slow 
variable.  Hence,  the  fast  dynamics  would  have  to  be 
evaluated  after  each  switch  in  the  corresponding  conmol 
component. 

Reduced-order  models  of  the  system  described  above 
are  developed  in  this  paper  using  a  singular  perturbation 
approach.  The  boundary  layer  approx  '  anon  :s  given  in 
Section  2.  Section  3  presents  the  cater  solution  ap¬ 
proximation.  A  numerical  example  is  given  in  Section  4 
and  concluding  remarks  are  given  in  Section  a. 
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2.  Boundary  Layer  Solution 
The  fast  dynamics  are  most  prominent  during  the 
initial  boundary  layer  and  can  be  separated  from  the 
slow  dynamics  by  the  introduction  of  an  expanded  time- 
scale  r  =  (t-tg)/^-  It  can  be  easily  shown  that  £  stays 
relatively  constant  with  respect  to  r;  hence,  £  is  approxi¬ 
mated  by  £0.  The  approximation  for  rj  is  given  by  the 
solution  to  the  following  equation. 

^  =  AJ  +  B2uf;  ft(0)  =  n.  (6) 


u 


qiC-Ki^o-K,^) 

q,(-K„e0-K«*» 


This  reduced-order  model  is  also  a  variable  structure 
system  with  nk  possible  linear  subsystems.  Hence,  the 
reduced-order  state  space  (Rr)  can  be  partitioned  into  nk 
nonoverlapping  regions  for  which  the  system  is  linear. 
Analogously  with  the  full-order  model,  the  regions  Rjj 
are  defined  as: 


R(J.  =  {f):  q,(-K,1$0-K12f))  =  c,_i  and 

q2(-K21£0-K22f))  =  c2>]}  (7) 

Correspondingly,  the  boundaries  between  regions  are 
hyperplanes  defined  by  -K,,£0-K12i?  =  d,  f  for  some  i  or 
-Ka,€o-M  =  d2,j  for  some  J- 

It  is  assumed  that  the  system  in  (6)  is  asymptotically 
stable  to  one  equilibrium  point.  Stability  in  "ordinary" 
smooth  system  can  be  shown  by  use  of  Lyapunov’s  second 
method.  However,  in  variable  structure  systems  the 
Lyapunov  function  is  generally  discontinuous  and,  hence, 
not  everywhere  differentiable.  Paden  and  Sastry  intro¬ 
duced  a  generalized  Lyapunov  theorem  in  [7j  which  is 
suitable  for  discontinuous  functions.  Such  a  method  may 
be  useful  in  determining  asymptotic  stability  of  this 
system.  The  equilibrium  point  of  (6)  (also  defined  as 
quasi-steadv-state  solution  at  t =0,  n,(0))  is  derived  below 
and  a  discussion  of  the  approximation  error  follows. 


2.1  Evaluation  of  the  Equilibrium  Point 

There  are  three  basic  positions  for  the  equilibrium 
poin*  .£  (6):  in  the  interior  of  a  region,  on  single  bound¬ 
ary  hvperplane,  or  on  an  intersection  between  two  bound¬ 
ary  hyperplanes.  The  first  two  cases  are  treated  very 
similarly  to  the  scalar  quant  red  control  analysis  discussed 
in  Reference  [6].  The  last  case  is  more  complicated  and 
can  be  solved  uniquely  only  for  limited  types  of  systems. 
The  derivation  of  the  equilibrium  point  as  a  function  of 
K,,£0  and  K?2f0,  ft'K^K^fg),  is  shown  m  the  next 
three  subsections  for  the  three  possible  positions. 

2.1.1  Interior  Position.  The  piecewise-linearity  of  the 
system  in  (6)  is  utilized  in  determining  the  system  be¬ 
havior;  i.e.,  for  ft  in  the  R^  region  of  the  state  space,  the 
system  is  given  by  the  following  description. 


dr 
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C2.,  i 
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The  behavior  in  this  region  is  governed  by  the  position  o 

the  followng  point  which  is  termed  the  regional  equi 

lib  lum  point. 

,  r  1 

-1  r,  1  C,  ,  j  (Q 


V,, 


-A,  B, 


If  n lies  in  R^  (i.e.,  satisfies  (7)),  then  it  is  a  local 
equilibrium  point.  If  does  not  lie  in  R,^,  then  points 
in  R;j  are  directed  out  of  the  region. 

2.1.2  Single  Boundary  Position.  An  equilibrium  point 
may  lie  on  a  boundary  between  two  regions  if  trajectories 
from  the  two  bordering  regions  head  toward  the  boun¬ 
dary.  To  find  the  conditions  for  such  an  occurrence,  the 
specific  example  of  an  equilibrium  point  existing  on  the 
boundary  between  RSj  and  Rj.,  j  is  examined.  The 
control  u2  =  Cj  j  is  constar*.  arro::'±ir  bemdary,  b”*  u, 
switches  between  c,  j  and  c,  Suppose  the  following 
condition  is  satisfied. 

di,i.i  +  K^r,  *  -K^o  <  d1<M  +  (10) 

where  r;,.,  -  and  rj{  j  are  regional  equilibrium  points  for 
R,.,^  ana  R^  respectively.  This  condition  states  that 
each’ of  the  regional  equilibrium  points  of  the  two  border¬ 
ing  regions  lies  across  the  boundary  hyperplane  from  its 
associated  region.  Hence,  trajectories  in  Ri(1j  and  in 
R,j  move  toward  that  hyperplane.  If  the  regions  were 
parallel  then  this  would  be  a  sufficient  condition  for  the 
equilibrium  point  to  lie  on  that  boundary  hyperplane. 
However,  with  nonparaUel  regions  it  is  possible  for  the 
representative  point  to  move  (or  slide)  ailong  the  hyper¬ 
plane  until  it  reaches  a  position  where  the  hyperplane  no 
longer  borders  R(j  and  R|(,j.  Hence,  to  find  the 
equilibrium  point,  first  assume  that  the  regions  arc 
parallel  and  find  the  equilibrium  point  to  such  a  system. 
Then,  if  that  candidate  equilibrium  point  lies  on  the  part 
of  the  hyperpiane  that  borders  the  actual  regions,  it  is  a 
true  equilibrium  point. 

The  equivalent  control  method  which  was  developed 
for  use  in  variable  structure  systems  [8]  is  used  to  find  the 
candidate  equilibrium  point.  If  representative  points  from 
both  sides  of  the  boundary  are  directed  toward  the 
boundary,  then  the  control  u,  starts  switching  very  quickiy 
between  c,  (  and  c,  while  u2=c2  f  remains  constant 
The  system’filters  out  the  high  frequency  leaving  only  the 
low  freauenev  average  component.  The  average  value  of 
u,  as  t-  +  «  is  the  equivalent  control  at  the  equilibrium 
point,  u,.  The  corresponding  candidate  equilibrium  point, 
H,,  is  found  to  be: 


7,  = 

(K„£a  +  di  (.,)(t?,,-'?,.,.j)  (K.^.,  ,, 


Ki2<7,...rV 


K.2(7,.,.rnM)  (ID 


This  is  a  true  equilibrium  point  if  it  lies  on  the  pan 
of  the  hyperplane  that  borders  R^  and  R;.,  This 
corresponds  to  satisfying  the  following  condition: 


d2i  +  K22n,  i  -K2,£0  <  d2  +  K22hs  (17) 

Hence,  if  the  conditions  in  equation  (10)  and  (12)  are 
satisfied,  men  the  equilibrium  point  is  given  by  equation 
(11).  Tnese  conditions  are  illustrated  in  the  second-order 
example  given  in  Figure  1.  Note  that  the  equilibrium 
point  lies  on  the  intersection  of  the  line  connecting  rj 
and  rj,.,  :  and  the  hyperpiane. 

The  equilibrium  point  existing  on  a  boundary  betwe-  n 
R  and  R,  is  found  using  a  dual  argument.  The 
conditions  in  (10)  and  (12)  correspond  to  the  foilosvir 
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conditions,  respectively. 

d2, i*1  +  j  ^"^21^0  <  d2. j»1  +^22^i,j»1  an<^(^) 

d1i  +  ^12^*  *  -K,,<0  <  d1.t*1  +  ^12^» 

where  the  equilibrium  point  is  given  by: 


(KziCp  +  U;  j^i,  j.i)  +(^;2^i  ,j»l)^i  j  *  (^22^1  j)7i.j-l 

^22^1, j»1  ‘  7|j)  ^22^i. j-1  '  0^) 

Therefore,  if  the  conditions  in  (13)  and  (14)  are  satisfied, 
then  the  equilibrium  point  is  given  by  (15). 

2.13  Boundary  Intersection  Position.  Another 
possible  position  for  the  equilibrium  point  is  on  the 
intersection  between  two  boundary  hyperplanes.  To  find 
the  conditions  for  this  occurrence  it  is  known  that  neither 
the  conditions  for  (11)  or  for  (15)  to  be  the  equilibrium 
point  can  be  satisfied  since  it  is  assumed  that  there  exists 
only  one  equilibrium  point.  There  are  two  separate 
additional  conditions  on  the  system  each  of  which  yield  an 
equilibrium  point  on  the  intersection  of  boundaries. 

d1,i»l  +  ^12^1]  ^  -K„$o  <  dl,i»1  T  ^I2^i*1,j»1 

and  (16) 

d2,J»1  +  *  *^21^0  <  d2,jM  +  ^22^t,j«1 

or 

dl,i*t  +  ^12^1, j*1  *  *^li€o  <  d1,l*1  +  ^12*ft*1.J 

and  (17) 

d2,J*1  +  *  ’^21^0  <  d2,i»1  + 

A  second-order  example  of  a  system  satisfying  the  condi¬ 
tion  in  (16)  is  given  in  Figure  2. 

The  equilibrium  point  for  system  (6)  is  not,  in 
general,  found  uniquely  when  either  the  condition  in  (16) 
or  (17)  is  satisfied.  By  definition  of  the  quantizer  func¬ 
tion,  for  the  equilibrium  point  to  exist  on  the  intersection 
between  two  boundary  hyperpianes,  then  7,  must  be  a 
solution  to  the  following  equation. 

IS  0  =»  '  j^0  for  some  ij  (18) 

ra2,j*i  " 

For  a  second-order  system  with  an  invertible  [K12T,  K„T]T 
matrix,  a  unique  solution  for  7,  is  found  to  be  given  by 


KJ*1  -d,.,.,  -  K,,€0 

^22  ^'^2,  J*1  '  ^21^0 


If  it  can  be  determined  that  the  system  is  sliding  on 
the  intersection  of  the  hyperpianes,  which  is  the  surface 
defined  by  the  solution  rj  of  equation  (18),  then  a  unique 
solution  for  7,  may  exist  for  larger  order  systems.  Using 
notation  comm  to  variable  structure  system  theory, 
define  an  affine  i^.ictionai  s  as: 

s  *  -Kr?  +  v  (20) 

where  K  and  v  are  determined  from  (18)  such  that  the 
solution  of  s(n)  =  0  is  the  switching  surface.  A  sliding 
mode  exists  on  that  surface  if  sTs<0  (8].  The  equilibrium 
point  of  the  sliding  mode  can  be  found  by  first  transform 
ing  the  system  into  regular  form,  then,  finding  the  equi¬ 


librium  point  for  the  sliding  mode  equation  in  the  new 
coordinates  and,  finally,  transforming  the  equilibrium 
point  back  to  the  original  coordinates.  The  equilibrium 
point  of  a  sliding  mode  existing  on  the  intersection  of  the 
two  boundary  hyperpianes  is  given  below  (for  details  of 
the  derivation,  see  [9]). 

7,  =  -<S,  -  Sj(KS2)-1KS,)A,*1T,A2S2(KS2)-1v 

+  S2(K Sj)*1v  (21) 

where:  At  =  T,A2S,  +  T, A2S2(KS2)’*(-KS,),  T,  =  U2\ 
S,  =  U2,  S2  =  U,  and  U,  and  U2  are  obtained  from  a 
singular  value  decomposition  of  B2,  i.e.  B2  =  [U„  U2]XV\ 

2.1.4  Function  for  the  Equilibrium  Point.  In 
summary,  the  equilibrium  point  7,  for  the  system  in  (6) 
can  be  found  as  a  mapping  of  the  variables  Kn£„  and 
^21^0’  6,  =  f(K11(0,K2l$0),  from  equations  (9),  (11),  (15), 
(19),  and  (21)  depending  on  the  conditions  that  are  satis¬ 
fied.  The  mapping  is  a  function  (i.e..  single-valued)  since 
it  is  assumed  that  there  exists  an  unique  equilibrium 
point.  The  fitnetion  is  piecewise-linear  since  each  of  the 
function  definitions  is  linear.  Continuity  is  not  guaran¬ 
teed  and  may  need  to  be  evaluated  on  a  case  by  case 
basis.  However,  using  the  methods  in  [6]  for  the  scalar 
control  case,  it  is  straightforward  to  show  that  the  func¬ 
tion  is  continuous  for  a  second-order  system.  To  demon¬ 
strate  the  use  of  this  function,  an  example  is  given  below. 

Example:  Let  a  system  be  given  by 


f  4  O' 

[o  4]  U 


The  control  selected  is  bang-bang  with  components: 

u,  =  sgn  s,;  s,  =  -[1.  2]rj  -  0 

u2  =  sgn  s2;  s2  *  -[-2,  2]rj  -  7  (23) 

where  the  sgn  function  is  defined  below: 

j  1  if  s  >  0 

s^s=  (-1  if  s<0  (24) 

where  5,(77)  =  0  and  sz(7)  =  0  define  the  switching  hyper¬ 
pianes  and  0  and  7  are  parameters.  To  correspond  to  the 
previous  notation,  let 

K,2  =  [l.  2];  K«-f-2.2];  0  =  7  -  K,,^, 


-8  4 

.  0  -4  ’ 


4  0 

.0  4. 


C1,1  *  <=2,1  “  ’I.  c1,2  *  C7.Z  *  d1.2  *  d2.2  *  0 

Define  the  regions  as 

R„  ={77:  u,-  1  and  u-=-l)  =  {7:  5>-K,2t7  and  7>-K22n) 
R,j  =(77:  u,  =-l  and  u2=  1}  =  (7:  0>-K,27  and  rs-K227> 
R2,  -fp:  u,  =  1  and  u2=*-l}*{7:  0S-K,27  and  7>-K22n) 
R^  ={7:  u,  =  l  and  u2=»l)  =(7:  0S-K,27  and  7 S-K227} 

The  regional  equilibrium  points  a*e  found  to  be 

7||  a  [|Jj  :  =[?  ;  721  »_.?]  ;  r,tt  =  [|  (26) 
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The  equilibrium  point  can  be  found  as  a  function  of 
0  and  7  using  the  function  definitions  given  in  equations 
(9),  (11),  (15),  (19)  and  f 21  j.  For  a  regional  equilibrium 
point  to  lie  inside  its  associated  region,  one  of  the  follow¬ 
ing  conditions  must  hold: 


a) -if  0>3  and  j>0  then  p^eR,,  and 

b)  if  0>-2  and  75-2  then  n12€R12  and  r)s  =  7,2 

c)  if  0<2  and  7>2  then  n21tLR21  and  rjs  =  n21 

d)  if  0<-3  and  rsO  then  722eR22  and  '7,  =  '722 


Note  that  these  are  mutually  exclusive  conditions. 

For  an  equilibrium  point  to  lie  on  a  boundary,  either 
conditions  (10)  and  (12)  or  (13)  and  (14)  are  satisfied. 
The  following  conditions  and  corresponding  equilibrium 
points  are  given  for  to  lie  on  the  boundaries  between 
R,,  and  R21,  R12  and  R22,  R,,  and  R12,  and  R2,  and  R22, 
respectively. 


m 

[  : 

21 

M 

~T 

l*i] 

.f-i 

3lo 

+ 

g)  if  0>7>-2  and  3<0-7(5/2)  then  n,  =  7^  +  j 

h)  if  2>7>0  and  -3>0-7(5/2)  then  tj,  =  T^j  +  | 

If  the  equilibrium  point  of  (22)  lies  on  the  intersec¬ 
tion  between  s,  and  s2,  then  the  following  must  be  true. 


i)  if  the  conditions  in  a)-h)  are  not  satisfied  and 
3>0>-3  and  2>7>-2  then 


n 


'  -1/3  1/3 

’  0 

.  -1/3  -1/6 

.7  . 

The  domain  space  of  the  mapping  7,  =  f(0,7)  can  be 
partitioned  into  ten  nonoverlapping  regions  each  cor¬ 
responding  to  a  function  definition  a)-i).  It  can  be  shown 
easily  that  this  mapping  is  a  function  since  none  of  the 
regions  overlap.  Also,  the  function  is  continuous.  This 
can  be  shown  easily  by  noting  that  the  function  is  con¬ 
tinuous  within  each  partition  of  the  0-7  space.  It  is 
straightforward  to  show  that  on  any  boundary  between 
nvo  partitions  the  two  function  definitions  are  equal. 

12  Boundary  Laver  Approximation  Error 

The  errors  introduced  by  approximating  the  true 
solution  by  the  solution  of  (6)  are  due  entirely  to  the 
assumption  that  £=>£„  in  the  boundary  layer.  As  in  the 
other  piecewise-linear  systems  discussed  in  [6,10],  the 
approximation  errors  are  of  order  0(4)  for  the  time 
intervals  when  both  the  actual  solution  and  the  approxi¬ 
mate  solution  exist  within  the  same  region  of  the  R'’*1’ 
state  space  due  to  linearity  When  a  single  boundary 
hyperplane  is  crossed,  then  the  previous  results  on  the 
approximation  errors  in  the  scalar  quantized  control  case 
are  applicable.  That  is,  if  the  vector  field  does  not 
intersea  the  boundary  with  an  angle  of  order  0(4 ),  then 
the  approximation  error  remains  ■  •(  order  0(4).  How¬ 
ever,  if  the  solution  crosses  a  boundary  hyperplane  within 
an  0(4)  neighborhood  of  an  intersection  between  bound 
ary  hyperplanes,  this  result  cannot  be  used. 


There  are  certain  problems  introduced  by  allowing 
intersections  of  switching  boundaries  to  exist  in  the  system 
definition.  If  the  actual  solution  crosses  a  boundary  near 
an  interseaion,  then  the  approximation  may  not  cross  into 
the  same  region.  From  that  point,  there  is  no  guarantee 
that  the  approximation  error  remains  of  order  O(n).  A 
consolation  in  this  is  that  if  the  system  is  not  sliding  on 
switching  surface,  then  the  chances  of  hitting  a  boundary 
within  an  0(4)  neighborhood  of  the  intersection  for  an 
arbitrary  initial  condition  is  of  order  0(4).  The  exception 
to  this  is  when  the  equilibrium  point  Lies  on  the  intersec¬ 
tion.  In  that  case,  the  solution  must  eventually  travel  into 
the  0(4)  neighborhood  about  that  point  and  will  cross  the 
boundaries  there.  However,  if  the  function  f  which 
defines  the  equilibrium  point  is  continuous,  then  0(4) 
approximation  errors  in  £  result  in  0(4)  errors  between 
the  equilibrium  point,  pt,  and  the  actual  value  of  rj(r)  as 
Therefore,  if  the  solutions  differed  by  an  amount 
of  order  0(4)  prior  to  entering  the  small  neighborhood 
about  then  continuity  of  f  implies  that  the  error  will 
remain  of  order  0(4). 

3.  Outer  Solution 

The  outer  solution  is  found  by  neglecting  the  fast 
dynamics.  It  is  assumed  that  the  fast  variable,  17.  reaches 
a  quasi-steady-state  value  within  the  initial  boundary 
layer.  This  initial  quasi-steady-state  value  is,  of  course, 
the  equilibrium  point  of  the  fast  subsystem  (6).  The 
quasi-steady-state  solution,  pt(t),  cannot  be  found  by 
simply  setting  4  =  0  in  equation  (2)  as  is  done  in  standard 
singular  perturbation  techniques,  because  the  solution  n, 
to  the  resulting  equation  is  undefined  for  values  of 
(£,,)?,)  that  lie  on  a  switching  boundary  in  the  state  space. 
Instead,  nt,  is  found  as  an  equilibrium  point  of  the  fast 
subsystem  using  the  function  defined  in  Section  2.1,  i.e. 
h,(t)  =  fflC11*,(t).Kl,*,(t)). 

Similarly,  the  control  in  the  slow  time-scale,  ut, 
cannot  be  obtained  by  simply  substituting  £,  and  rjf  for  £ 
and  t)  in  the  original  definition  of  the  control  (3)  because 
the  control  is  undefined  for  values  of  (£*,17,)  that  lie  on 
a  switching  boundary.  In  ordinary  systems  with  discon¬ 
tinuous  control,  the  system  chatters  along  the  sliding 
surface  causing  the  control  to  switch  at  a  very  high 
frequency.  The  average  value  of  the  control  (i.e.  the 
equivalent  control)  determines  the  motion  of  the  system 
along  the  sliding  surface.  In  this  application,  however,  the 
system  does  not  chatter  as  seen  from  the  definition  of  r\%. 
Thus,  th-  ■  control  must  be  given  as  the  equivalent 
con-  -  jVise  values  of  (£,,t?s)  that  lie  on  a  switching 
bov...  ,  _  Alternately,  it  can  be  given  as  the  final  value 
of  the  equivalent  control  in  the  fast  time-scale. 

u,  =  -(B/B^VAjP,  (27) 

Note  that  the  use  of  the  psuedo-inverse  is  justified  by  the 
consistancy  of  the  equation,  i.e.,  was  derived  from  the 
equivalent  control. 

Thus,  the  solution  of  the  system  (l)-(2)  can  be 
approximated  outside  of  '.he  boundary  ’aver  by  the 
solution  to  the  following  system: 

£,  =  A0£f  +  B0u,;  £,(0)  =  f0  (2S. 

7.  »  f(K„£,.K21£t) 
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where  u  is  given  in  equation  (27)  and  the  function  f  is 
defined  from  (9),(ll),(l5),(19)  and  (21).  As  ctated  previ¬ 
ously,  f  is  piecewise-linear  and  may  be  continuous;  hence, 
the  slow  manifold  defined  by  the  function  f  is  piecewise- 
linear  and  may  be  continuous. 

The  continuity  of  the  slow  manifold  is  required  for 
the  approximation  errors  given  by  n(t)-r!,(t)  and  £(t)-£,(t) 
to  be  of  order  O(n)  for  the  time  outside  of  the  initial 
boundary- layer.  If  the  function  defining  the  slow  mani¬ 
fold  is  continuous,  it  satisfies  a  Lipschitz  condition  since 
it  is  piecewise-linear  [11].  Therefore,  a  sufficient  condi¬ 
tion  for  the  approximation  error  to  be  of  order  O(t0  is 
that  the  Lipschitz  constant  be  bounded  as  m-0.  i.e.  it 
cannot  be  of  order  0(l//i).  If  it  was  of  order  0(1 /m), 
then  the  equilibrium  point  for  the  fast  dynamics  would 
change  too  quickly  in  the  t-time  scale  thereby  invalidat¬ 
ing  the  separation  in  time  scales  between  t  and  r.  The 
resulting  behavior  would  require  evaluation  of  the  fast 
dynamics  after  each  switch  in  the  control,  and  the  slow 
model  approximation  in  (28)  would  be  valid  only  in  the 
time-intervals  between  switches. 

4.  Numerical  Example 

An  example  of  the  approximation  method  for  the 
two-input  quantized  control  system  is  demonstrated  here. 
The  system  is  given  in  the  form  of  equations  (l)-(2).  The 
control  is  selected  to  be  bang-bang  with  components 

u,  =  sgn  s1;  s,  =  -K,,£  -  K,2r?  (29) 

u*  =  sgn  Sj;  s2  =  -K21£  -  K nri 

The  parameter  matrices  are  given  as  follows: 


'-3  10]  0  O' 

A,*  -1  -3  0  B0  =  -10 

0  0  -2j  0  -1 


K„  =  [2  1  1];  K,2  =  [1  2]; 

K21  =  [1  1  2];  K22  =  [-2  2] 

To  correspond  to  the  previous  notation,  the  parameters 
of  tbe  quantizer  functions  are  defined  as; 

C1,1  3  c2,1  =  *1;  C1,2  =  <=2,2  3  !;  d1.2  3  d2,2  3  0 
The  initial  conditions  are  £0  =  [-1.5,  1,  -0.75]T  and  tj,,  * 
[1,  2]r  and  At  =  0.1. 

Note  that  the  fast  subsystem  is  the  same  as  given  in 
the  example  described  in  Section  2.1.4.  Therefore,  the 
boundary  layer  approximation  is  found  as  the  solution  of 
(22).  The  equilibrium  point  of  this  system,  7,(0),  is  found 
as  a  function  of  K,,£0  and  K2,£0  using  the  function 
definitions  a)-i)  listed  in  Section  2.1.4  where  0=  K,,£0 
and  7  =  K21£0.  With  the  given  initial  conditions  on  £,  the 
equilibrium  point  is  found  to  be  7,(0)  =  [-2.75,  -2]r.  This 
corresponds  to  an  equilibrium  point  existing  on  the 
boundary  between  R,2  and  R22  so  that  as  f)  approaches 
7,(0),  the  control  u,  begins  switching  very  rapidly  while 
u2=l  remains  constant.  It  can  be  shown  that  with  the 
given  initial  conditions  on  rj,  the  fast  subsystem  will  slide 
in  the  r-ume  scale  on  the  switching  surface  defined  by 
5,-0. 

The  outer  solution  is  found  from  the  slow  model  in 


(28).  The  quasi-steady-state  solution  is  found  from  the 
function  definitions  a)-i)  listed  in  Section  2.1.4  where 
$  =  K.,,£,  and  7  =  K21£,.  The  initial  conditions  for  this 
example  are  such  that  the  system  starts  out  sliding  on  the 
s,  =0  surface  in  the  normal  time-scale. 

Comparison  between  the  time-integration  of  the 
actual  system  and  the  approximate  system  are  shown  for 
representative  states  in  Figures  3-4.  The  errors  between 
the  trajectories  are  of  order  O(n).  In  the  approximate 
solution,  the  boundary  layer  correction  (f)-7,(0))  is  added 
to  the  outer  solution  for  0sts0.2,  beyond  which  it  is  negli¬ 
gible.  Both  the  approximation  and  the  true  solution  are 
asymptotically  stable  to  the  origin.  Therefore,  both 
systems  are  found  to  slide  on  the  intersection  of  the 
switching  surfaces  defined  by  s,=0  and  s2  =  0.  The 
computation  time  for  obtaining  the  actual  solution  was 
roughly  18  times  longer  than  that  for  obtaining  the 
approximate  solution  for.  As  in  all  singular  perturbation 
approaches,  as  m  decreases,  the  approximation  becomes 
more  accurate  and  the  relative  computational  time-savings 
greatly  increases. 

5.  Summary 

This  paper  presents  the  analysis  of  a  singularly 
perturbed  two-input  quantized  control  system.  The 
discontinuities  in  the  control  occur  in  the  state  space  on 
single  boundary  surfaces  as  well  as  on  intersections  of 
boundaries.  This  latter  occurrence  is  precisely  what 
makes  the  analysis  of  the  two-input  case  so  much  more 
complicated  than  that  of  the  scalar  case.  In  spite  of  the 
complications,  reduced-order  models  are  developed  which 
yield  the  outer  solution  and  the  boundary  layer  solution. 
As  part  of  the  slow  model,  a  function  is  derived  which 
solves  for  the  quasi-steady-state  solution  in  terms  of  the 
slow  variable.  This  function  is  known  to  be  continuous 
when  the  fast  subsystem  is  second-order.  Other  cases 
may  need  to  be  evaluated  numerically.  As  in  the  scalar 
case,  the  system  may  possess  a  sliding  mode  in  the  fast 
time  as  well  as  the  slew  time  scales.  The  results  of  a 
numerical  simulation  show  that  the  approximation  method 
described  in  this  paper  can  yield  very  accurate  results 
with  very  good  computational  time-savings. 

It  appears  that  the  extension  of  singular  perurbation 
theory  to  the  general  multiple  input  quantized  control  is 
a  very  complex  problem.  In  particular,  finding  a  function 
which  solves  for  the  quasi-steady-state  solution  is  very 
tedious  and,  in  fact,  may  not  be  possible.  Hence,  in 
general,  singular  perturbation  theory  does  not  simplify  the 
analysis.  However,  it  may  be  the  only  alternative  if  the 
original  system  is  too  numerically  stiff  to  be  solved  any 
other  way.  A  discussion  of  the  multiple  input  case  is 
contained  in  [9]. 
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Figure  1:  Graphical  example  of  condition  for  equilibrium 
point,  to  Lie  on  a  boundary  between  R,  j  and  R,.,  r 


0  00  0.50  too  150  ZOO 

Tim*  (a*c) 


Figure  3:  Response  of  to  initial  condition  for  actual 
system  (solid  line)  and  approximate  system. 


Figure  2:  Graphical  example  of  condition  for  equilibrium 
point,  rjt,  to  lie  on  intersection  of  boundaries. 


Tim*  (s*c) 


Figure  4-  Response  of  n2  to  initial  condition  for  actual 
system  (solid  line)  and  approximate  system. 


2183 


APPENDIX  J 


M.  A.  Ingram  and  A.  H.  Haddad,  "Optimal  and  Suboptimal  Filtering  for  Linear 
Systems  Driven  by  Self-Excited  Poisson  Processes",  Proc.  Annual  Allerton  Conference 
on  Communications.  Control,  and  Computing.  University  of  Illinois,  pp.  426-435, 


October  1987. 


OPTIMAL  AND  SDBOPTIMAL  FILTERING  To*  LINEAR  SYSTEMS  DRIVEN  BY  SZLP-EXC1TB) 
POISSON  PROCESSES 
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School  of  Electrical  Engineering 
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ABSTRACT 

Stochastic  differential  equations  for  the  conditional  density  function 
and  moments  are  presented  for  a  linear  system  which  is  excited  by  a  marked 
Poisson  process  whose  rate  depends  on  the  state  of  the  system  and  which  it 
observed  in  white  Gaussian  ..oise.  The  set  of  optimal  filtering  equations 
is  infinite  dimensional,  therefore,  any  practical  filter  is  suboptimal.  * 
suboptimal  filter  is  developed  for  the  case  of  unmarked  Poisson  excitation. 
This  suboptimal  filter  estimates  the  Poisson  process  via  a  combined  sequen¬ 
tial  estimation  and  detection  scheme  based  on  the  criterion  of  maximum 
a  posteriori  (MAP)  probability.  An  example  computation  is  presented. 


1.  INTRODUCTION 

This  paper  examines  the  issue  of  state  estimation  for  a  linear  system 
which  is  driven  by  a  Poisson  process  whose  rate  parameter  depends  on  tha 
state  of  the  system.  The  input  process  is  described  as  *self-excited* 
since  its  rate  function  can  be  specified  given  the  past  history  of  the 
input  process. 

The  model  of  a  dynamic  system  driven  by  a  Poisson  process  with  a  state 
dependent  rate  is  motivated  by  several  practical  situations.  In  aircraft 
maneuvers,  the  pilot's  discrete  application  of  controls  is  sometimes 
modeled  as  a  Poisson  input  process.  It  is  reasonable  to  expect  that  the 
rate  of  the  control  actions  is  dependent  on  the  state  of  the  aircraft. 
Another  example  is  the  tracking  of  a  light  source  with  a  photon  detector. 
The  rate  of  photon  arrivals  certainly  depends  on  the  state  of  the  tracking 
system,  notably  the  tracking  error  angle. 

The  most  general  system  considered  in  this  paper  is  described  by  the 
following  scalar  equations: 

dx  *  a  x  dt  ♦  b  dn  (1) 

t  t  t  t  t 


yt 


Vt 


(2) 


where  n^  is  a  marked  Poisson  process  whose  marks  (i.e.,  the  amplitudes  of 
the  jumps)  {u^}  are  a  sequence  of  mutually  independent,  identically 
distributed  random  variables  with  density  py(u). 


The  incident  rate 


-  A  -  •£* 

it  ne  is  a  memorylesa  function  of  the  state,  u(x^).  The  process  b  --- 

■  rownian  motion  with  diffusion  Vfc.  ■  SSr  • 

The  objective  is  to  estimate  xfc  given  the  history  of  the  observation 
irocess,  either  yg  or  zg,  for  s  <  t.  In  Section  2,  an  expression  foe  *  the  . 
nnimium  mean-squared  error  (MMSE)  estimate  is  derived,  and  shown  to '  be 
.mpractical.  Good  suboptimal  approximations  to  the  MMSE  estimate’ ere 
lesirable,  but  are  not  pursued  here.  Instead,  in  Section  3,  the  maxima 
i  posteriori  (MAP)  criterion  is  used  to  derive  a  practical  filter  for 

2.  OPTIMO.  FILTER  EQUATIONS 

This  section  derives  the  expression  for  the  stochastic  partial  differ¬ 
ential  equation  satisfied  by  pt|t(x),  the  conditional  density  function  of 
<t  given  Zt  »  (z^js  *  t},  based  on  a  filtering  theorem  for  white  Gaussian 
observation  noise.  Furthermore,  recursive  equations  are  obtained  for  the 
central  moments  of  this  density  function.  The  procedure  used  here  i» 
similar  to  the  one  used  by  Kwakernaak  (1)  to  analyze  a  linear  time 
invariant  (LTI)  system  driven  by  an  unmarked  Poisson  process  with  a 
constant  rate. 

First,  the  filtering  theorem  stated  in  Kwakernaak  [11  is  summarized 
for  the  special  case  of  a  scalar  system  with  independent  observation  noise. 

Filtering  Theorem  ( 1 ] ;  Let  Qfc,  t  >  tQ,  be  the  semi-martingale  defined 
by 

dQ  “  R  dt  +  dM  t  >  t  (3) 

etc  o 

where  is  a  martingale  with  respect  to  a  growing  family  of  a-fields  F^, 
t  >  tQ,  and  where  Rt  is  a  process  adapted  to  P.  Let  ze#  t  >  tQ,  be  the 
semi-martingale  process 

dz  »  h  dt  +  dw  t  >  t  (4) 

t  t  t  o 

where  h  is  another  process  adapted  to  F,  and  wfc  is  a  Brownian  motion  inde¬ 
pendent  of  F,  such  that  E(dw2)  «  vtdt,  Vt  >  0  for  t  >  tQ.  Define  Zt  as  the 

growing  family  of  e-fields  generated  by  the  process  zt.  For  an  arbitrary 

*  A  *  * 

process  £ define  £^  *  E(5tIZfc) .  Then  satisfies  the  dynamic  equation 

dQt  -  Rtdt  +  [^ht  -  <2tht]\’[dzt  -  htdt]  (5) 
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The  filtering  theorem  will  be  applied  to  Qt  •  i  ,  for  xfc  as  defined  ln 
(1).  However,  the  differential  rule  for  filtered  Poisson  processes  must 
first  be  used  to  obtain  dQt.  The  rule  may  be  found  in  Snyder  [2,  p.  200) 
and  is  also  a  special  case  of  the  differential  rule  for  discontinuoua  aemi- 
martingales  [1,3]. 

Differential  Rule  [2)i  Por  an  appropriately  smooth  function  Q(xt)  and 
for  xt  defined  in  (1),  the  rule  is 

3Q(x  ) 

dQ(xt)  -  atxt( — ^ — )dt  +  /  [Q(xt+btu)  -  Q(xt)]K(dt,du)  (6) 


where  the  last  integral  is  a  counting  integral  [2,  p.  195],  evaluated  over 
the  mark  space  U,  with  respect  to  the  Poisson  counting  measure  K(dt,du). 
A  (At, A)  is  the  number  of  jumps  of  n  during  the  interval  At  with  marks  in 
the  set  A  Q  U. 

Equation  (6)  nu,y  be  put  in  the  form  of  (3)  by  letting 


dM  »  /  [Q(x  +b  u)  -  Q(x  >][K(dt,du)  -  w  <x  )p  (u)dtdu] 

^  M  W  t  t  t  U 


(7) 


and  taking  Rtdt  as  the  remainder.  The  substitution  of  R  into  (5)  yields 


ivx 


d« 


ivx*  ivxfc  ivb  u 


t  /  r  " a.  « 

-  (ivatxte  ♦  e  l[e  -  l]u  (xt>) 


dt 


j  VX  * 

t*  -  ctxtat] 


(8) 


Let  9 1  ■  bfcu  (recall  u  is  the  mark  variable)  and  pg  (' )  be  rhe  probability 
density  function  for  0 If  it  is  assumed  that  t^ie  conditional  density 
function  p.jj(x)  exists,  then  taking  the  inverse  Fourier  transform  of  each 
term  of  (8)  yields 

dpt|t<x)  '  Lpt  1 1  <x )  dt  ♦  -  ctxtdt]  '9> 

where  L  is  the  linear  operator  given  by 

Lp(x)  «  -  [  a^xp  (x)  ]  «■  (p0  (x)  •  [u(x)p(x}])  -  u(x)p(x)  (10) 
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where  ***  denotes  convolution.  As  in  Kwakernaak's  case,  equation  (9)  is 
the  same  as  the  Kushner  equation  for  systems  driven  by  Brownian  motion, 
except  f or  the  definition  of  L. 

Equations  (9)  and  (10)  can  be  used  to  derive  stochastic  differ¬ 
ential  equations  for  xfc  and  the  nth  conditional  central  moments  Pn<  ■ 
E[ (xt-xt)n|Zt]as  follows: 

Pn,t  *  E(  <xt-*t,n,Zt]  n  -  '.2 -  (ID 


dxfc  «  atxfcdt  +  btE(u)u  (xt)dt  +  c^Pj  fc[d*t  ”  ctxtdtl 


n,t  ‘  natPn,tdt  *  J,  (^‘>ktE(u1')(xt-xt)n-Ku(xt)dt  -  nbtE(u)u  (xt)Pn_1>td 


*  V;'cJV,,t  *  nP2.tPn-1.t^dlt  -  Ctxtdtl 


if  Vp  rnz_L  P  P  -  p  1 

t  t  2,tl  2  2,t  n-2,t  n,tJ 


a  ■  2,3,. 


Equations  (11)  and  (12)  represent  an  infinite  set  of  coupled  stochas¬ 
tic  differential  equations.  Thus,  an  exact  mean-squared  error  optimal 
filter  is  impossible  to  implement.  Furthermore,  in  Kwakernaak’s  opinion, 
simple  truncation  of  the  moment  equations  (for  the  constant  rate  case) 
leads  to  unstable  filters  and  generally  poor  results.  Hence,  approximate 
suboptimal  filtering  techniques  are  required,  and  are  under  investigation. 
This  paper  considers  an  alternative  approach  which  uses  a  different  error 
criterion,  and  is  treated  in  the  next  section. 

3.  A  MAP  APPROACH 

For  this  analysis,  it  is  assumed  that  the  driving  process  is  a 

counting  process,  i.e.,  it  has  only  unit  jumps.  Furthermore,  it  is  assumed 

that  the  3yatem  being  driven  is  linear  time-invariant,  that  is,  »  a  and 

bfc  *  b  in  equation  (1).  Thus,  it  is  cleat  that  knowledge  of  the  jump  times 

implies  knowledge  of  xt.  The  approach  followed  in  this  section  is  to 

obtain  MAP  estimates  of  the  number  N^,  of  jumps  in  n  and  the  jump  times 

t  »  rT  .  .  i  on  the  interval  (0,T),  given  the  observations  T-  ■ 

~N_  T  1 ' T  2 .  N  1  T 

T  T 
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{y^is  <  t}  .  The  state  estimate  at  time  T,  denoted  x?,  then  constructed 
by  the  appropriate  superposition  of  lmDulse  responses.  The  approach  iE 
made  into  a  practical  sequential  algorithm  by  using  time  discretization  and 
a  finite  time  window. 

This  is  an  extension  of  the  work  of  Au  and  Haddad  [3]  wherein  the 
approach  outlined  above  was  taken  for  marked  Poisson  driving  processes 
which  have  constant  known  rates. 

The  MAP  estimates  N_  and  t  satisfy 
T  — M 


<W  -  “9  0  <  N*  ' £  M  X.T  I 

\  _ M  T'-M  T  T 


where  the  argument  of  the  logarithm  is  a  joint  a  posteriori  probability 
density  function.  M  is  an  integer  chosen  large  enough  so  that  Pr  ,'NT  >  m) 
is  negligible.  The  condition  N^,  <  M  ensures  that  ^  includes  enough  jump 
times  to  construct  x^. 

The  log  of  the  density  function  in  (13)  can  be  replaced  by  the 
following  expression  without  changing  the  resulti 

tn  /  [2yt  ■  I  h(t,Tf)][  l  h(t,t»)]dt 
■‘to  1-0  j-0  3 

+  ln  PVVVm[^'NT  "  N*'Nt  "  M)  +  tn  Pr[MT  ■  N*|NT  <  •  (M! 

The  first  term  is  recognized  as  the  log  likelihood  function,  wherein 

h(t,T*)  represents  the  response  of  the  system  at  time  t  to  an  impulse  at 

time  T*.  For  brevity,  h(t,r»)  is  defined  as  the  unforced  response  due  to  a 

known  initial  condition  x  . 

o 

The  next  objective  is  to  simplify  the  expressions  of  the  second  and 
third  terms  of  (14).  Note  that  the  event  NT  *  N*  is  also  the  event  k 
T  <  +  Therefore,  the  probability  dt  '  in  the  second  term  can  be 

rewritten  as 

■  n*'nt  4  M)  ■ 

— M  T  T 


pr  IN  <h^|nt  4 
—M  T 

Pr^TN*  <  T-V  +  1  >  T|N*  <  M1 


for  0  k  t  ...  t  tj.  k  T 


and  T  <  TS-  +  1  4  TA 


otherwise 


Since  In  0  -  — • ,  it  is  reasonable  to  restrict  the  region  over  which  the 
expression  in  (13)  is  maximized  to  the  region  of  support  of  (15).  Under 


this  restriction,  the  third  term  of  (14)  cancels  the  denominator  of  the 
nonzero  part  of  (15). 

The  remaining  term  to  simplify  is  the  numerator  of  the  nonzero  part  of 
(IS).  It  is  noted  that  the  evert  <T  <  M  is  also  the  event  t  ^  >  T.  Thus, 
the  term  of  interest  may  be  expressed  as  a  marginal  density  function: 


PT  IN  <M^INT  <  M)  ■  [  Pt  ,|T  .rt^el'Vl  >  T)dTA+1 


T  —M+1  M+1 


It  is  noted  that  the  region  of  support  of  the  integrand  is  over  the  "wedge" 

0  <  t  <  ...  <  t  <  t  minus  the  half  space  r  <  T.  Therefore,  (16) 
i  M  M+ 1  M+ l 


can  be  rewritten  as: 


The  unconditional  density  in  the  integrand  of  (17)  is  a  special  case  of  the 
density  considered  by  Snyder  (2,  p.  248]  for  a  self-exciting  point  process. 
For  this  special  ca-'  the  density  can  be  expressed  as: 


M+1  ,  i 

„  3  r 


PT  («♦,) 

-M+1 


a  -f-  -exp  /  -u(x  (tj_  )]dt  for  0  <  r*  <  ...  <  t*+, 
l»1  l  T*  . 


otherwise 


_  1-1 

b  «XP[  •»(  t— C  j)  ]  Vi  ,(  t  -  T*) 

where  u,  is  the  unit  step  function.  In  words,  7  (tF_j)  is  the  value  of  the 
state  assuming  tnat  n  has  had  jumps  only  at  times  T*,...,T*_..  Let  xj  T^) 
denote  the  unforced  value  of  the  state. 

Substitution  of  (18)  into  (17)  is  straightforward  due  to  the  product 
form  of  (18)  and  yields 

PT  (y)  Max(r*,T) 

Pt.N_<M^|NT  ‘  M)  ”  ir  U~>~  T)  eXp  {.  -“(«*(«)  Idt  ('9’ 

“  i  M  T 


where  it  has  been  assumed  that  there  exists  some  a  >  0  such  that  u  (xl  >  a 
for  all  x,  thus  making 

exp  /  -  ul»  Idt  *  0  • 
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It  is  notsd  that  p  (•)  is  defined  by  replacing  M  *  1  by  M  in  (18). 
Evaluation  of  the  derivatives  yieldsi 


T* 

M  _  1  _ 

n  ut xT*Cj^_1)3eJ‘P  /  ■  W[xt(-L*_i)]dt  Jot  0  *  T*  <  ...  <  T 

i*1  i 


i-I 


M 

(20) 


otherwise 


The  combination  of  equations  (18),  (15),  (19),  and  (20)  results  '.n  a  new 

HAP  equation: 


arq 


Max 

0  <  N*  <  M 


Max 


t*cR. 


T*  <  ...  <  T  2^7  {  [2yt  ' 

LT  <  T*#+1  <  ...  <  T* 

,  \  Max(t*  t) 


+  tn(  n  u[xt.(T._1)]  -  /  u(xt(r^)]dt 


i-1 


(21, 


where  the  maximization  is  to  be  performed  in  two  steps,  first  over  the 
for  fixed  N*,  and  second  over  the  N"s. 

n 


4.  SEQUENTIAL  MAP  APPROXIMATION 

The  MAP  equation  (21)  derived  in  the  previous  section  is  now  approxi¬ 
mated  as  a  sequential  algorithm.  In  this  approximation,  the  observations 
are  processed  in  subintervals  each  of  length  A,  which  is  chosen  such  that 
the  probability  of  having  two  or  more  jumps  in  each  interval  is  negligibly 
small.  Each  subinterval  of  observations  is  used  to  detect  a  jump  in  the 
subinterval  and  to  estimate  the  jump  time,  as  well  as  to  update  the 
estimates  of  past  jump  times. 

In  order  to  reduce  computational  complexity  of  the  algorithm, 
estimates  further  than  L  subintervals  away  from  the  new  subinterval  are  not 
updated  and  considered  'finalized.*  The  selection  of  I,  represents  a 
tradeoff  between  performance  and  complexity.  Thus,  observations  in  the 
kth  subinterval  [(K-1)A,KA)  are  used  to  update  estimates  in  the  'window* 

[ (K-L)A,KA) .  N(n_Lja  represents  the  number  of  finalized  estimates  of  jump 
times. 

Equation  (21)  is  next  modified  so  that  maximization  is  performed  only 
over  jump  times  occurring  after  the  time  (K-L)A.  Any  additive  terms  which 
depend  solely  on  finalized  estimates  are  dropped.  For  brevity,  let  Ny  • 
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N,_  m  !"**  ior  finalized).  Furthermore,  redefine  t  aa  [t~  . ,r~ 

ir-uja  —  k  N  n 

and  redefine  *,.(.££)  aa  the  atate  assuming  that  jumpa  have  occurred  only  at 
the  finalized  tinea  and  at  the  propoaed  times  r*  The  modified  (approxi- 

“’la 

mate)  version  of  (21)  isi 


Max 

("KA'lJ  *  <,r<3  - 

N  <  N*  <  N  +  L 
F  F 


Max 

\  ‘  V  4  "•  *  T5*  <  “ 

“  <  tnVi  <  —  <  th\l 


.  KA  _ 

jv~  /  (2yt  -  x(i£)Ux(T£)ldt 

T  (K-£,)A 


Max(T5  +L«W) 

F 

/  w[xt(l£))dt 

(K-L)A  6  ^ 


(22) 


There  is  a  remaining  difficulty  with  the  maximization  over  the  T*'a  in 
(22).  Assume  that  this  maximization  ia  being  performed  for  a  given,  fixed 
N*.  Furthermore,  assume  a  discretized  domain,  i.e.,  a  aubaet  of  equally 
spaced  discrete  values  in  vf".  The  discretization  implies  that  the  expres¬ 
sion  in  (22)  ia  evaluated  over  a  finite  number  of  values  for  the  t*'a 

between  t~  and  KA ,  but  there  are  still  an  infinite  number  of  values  to 

"p 

check  for  the  t*'s  above  KA.  Maximizing  over  these  'future*  jump  times  is 
equivalent  to  maximizing  the  joint  a  priori  probability  density  for  these 
jump  times. 

The  constant  rate  ca3a  (u[x£!  ■  uq)  presents  no  difficulty,  because 
the  joint  a  priori  density  function  for  the  jump  times  after  KA  has  its 
maximum  at  •  ...  »  r~  +L  «  KA.  It  is  easily  shown  that  the 

same  is  true  for  stable  first  order  systems  and  rate  functions  u  [x]  which 

monotonically  increase  with  Ixl.  However,  for  more  general  LTI  systems  and 
rate  functions,  findino  the  maximum  of  the  a  priori  joint  density  is 

apparently  not  aa  easy.  This  matter  is  currently  under  investigation. 


S.  KXAMPLK 

Figures  1  and  2  display  simulation  results  based  on  the  algorithm  of 
Section  4.  The  parameters  are  (see  equations  1  and  2)  at  ■  -5,  bt  »  2,  and 
ct  ■  1.  The  rate  or  intensity,  u(xt),  of  the  counting  process  n takes 
only  two  values!  u(xt)  «  2  for  I  xt  I  <  1  and  u(xt)  •  4  for  I  xfc  I  >  1. 
Figure  1  contains  the  state  trajectory.  The  rate  takes  its  high  value  when 
the  trajectory  is  above  the  dashed  line  and  the  low  value  otherwise. 
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For  estimation,  4  •  0.03125  sec.  This  yields  an  approximate  upper 
bound  for  Prtr't+i  “  >  1  ]  of  44  •  0.125.  The  observation  noise  samplss 

have  a  standard  deviation  (/V  J  of  0.15.  The  estimation/detection  window 
is  L  •  4.  Estimation  results  are  shown  in  figure  2.  Some  errors  may  be 
observed  at  t  -  2  and  3  <  t  <  4.  It  is  noted  that  for  •  0.1,  all  of 

the  jumps  were  correctly  detected  (to  the  order  of  the  simulation  sampl* 
period)  and  for  /V  •  0.2,  several  more  false  detections  occurred  in  the 
region  0.5  <  t  <  1.5. 


6.  CONCLUSIONS 

The  state  estimation  problem  has  been  considered  for  a  linear  system 
observed  in  additive  white  Gaussian  noise,  where  the  system  is  driven  by  a 
Poisson  process  with  a  state  dependent  rate.  It  is  no  surprise  that  the 
minimum  mean-squared  estimator  is  infinite  dimensional,  since  the  same  is 
true  for  the  simpler  constant  rate  case.  However;  it  is  expected  that  the 
form  of  the  equations  will  suggest  a  good  suboptimal  approximation  in  the 
future.  An  implementable  estimator  was  developed  based  on  maximum 
a  posteriori  (MAP)  estimates  of  the  number  and  times  of  the  jumps  in  the 
driving  process.  However,  the  feasibility  of  this  scheme  has  been  shown 
only  for  certain  LTI  systems  and  rate  functions.  Further  ir.vr'tigttior. 
is  needed  to  enlarge  the  apparently  limited  applicability  of  this  MAP 
approach. 
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Abstract 

A  sequential  detection  scheme  is  used  to  determine  the  approximate 
occurrence  times  of  impulses  in  a  self-excited  point  process  which  drives 
a  scalar  linear  system.  Observations  of  the  state  are  corrupted  by 
additive  white  Gaussian  noise.  The  state  estimate  is  constructed  based 
on  the  detected  impulses. 

1  Introduction 

Linear  systems  driven  by  a  combination  of  a  marked  (randomly  weight¬ 
ed)  impulse  process  and  a  white  Gaussian  noise  process  have  been  used 
as  models  for  maneuvering  targets  [1],  switching  environments  (2j,  and 
seismic  signals  in  oil  exploration  [3j.  Impulsive  input  processes  with 
state- dependent  statistics  are  applicable  if  a  system  is  prone  to  a  high 
disturbance  rate  in  some  regions  of  the  state  space  and  a  low  rate  in 
other  regions.  As  preparation  for  the  analysis  of  the  complex  model  de¬ 
scribed  above,  a  simpler  problem  has  been  addressed  in  which  the  only 
disturbance  is  a  self-excited  point  process  [9]  with  constant  marks.  The 
process  is  described  es  self-excited  because  iu  instantaneous  average 
rate  is  a  function  of  the  state  of  tbe  system  being  driven. 

The  optimal  mean  square  error  filter  involves  an  infinite  set  of 
coupled  stochastic  differential  equations  (4,6).  In  cases  where  tbe  in¬ 
stantaneous  average  rate  of  the  input  process  is  constant  and  is  low 
relative  to  the  bandwidth  of  the  system,  various  truncations  of  the 
optimal  filter  have  performed  poorly  [4,8]. 

These  performance  reports  have  prompted  investigation  into  other 
approaches  [8,6,5]  which  resemble  a  maximum  a  posteriori  (MAP)  ap¬ 
proach.  These  approaches  share  the  basic  goal  of  determining  the  num¬ 
ber,  N,  and  the  arrival  times,  tj/,  of  input  pulses  in  a  time  interval, 
using  observations  over  that  time  interval  and  a  priori  statistics.  Au  [8] 
and  Kwakernaak  [5]  make  the  additional  assumption  of  random  marks. 
However,  it  will  be  demonstrated  that  an  inherent  difficulty  with  thif 
problem  is  preserved  in  the  constant  mark  case.  In  all  approaches  but 
Kwakernaak’*,  the  solutions  to  this  fixed-interval  smoothing  problem 
were  transformed  into  fixed-lag  smoothing  algorithms  by  allowing  the 
interval  to  become  a  moving  window.  A  smoothed  estimate  of  the  state 
is  constructed  by  superimposing  responses  to  the  detected  impulses  as 
they  are  left  behind  by  the  time  window. 

In  order  to  discuss  the  problem  further,  some  definitions  are  needed. 
Let  Noj  be  the  number  of  input  impulses  in  the  interval  (0,7').  Let 
la  =  [n>^.--.,r«]  represent  the  first  n  consecutive  snivel  times  of 
tbe  impulses.  Assume  observations  of  i  scalar  system: 

Vt  -  *t(Uv,,T)  +  «€(0,T)  (1) 

W,.r 

=  £  M*  “»»)  +  •(*, 0)*o  (2) 

<-l 

where  w,  is  Gaussian  white  noise  with  spectral  height  a1  ,  h(t)  is  the 
impulse  response  of  the  system,  end  4(t,0)zo  is  the  unforced  response. 
Let  {y0,r}  repr*«»nt  the  observations  over  the  interval  |0,T).  The 
instantaneous  average  rate  of  input  impulses  is  defined  to  be  p[xt], 
vh-re  it  is  a  positive,  bounded  function. 
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The  difficulty  referred  to  above  is  how  to  properly  use  the  s  priori 
statistics  in  this  problem.  Tbe  procedure  is  straightforward  when  the 
goal  is  to  produce  a  MAP  estimate  of  the  consecutive  arrival  times  in 
an  interval  [0,T),  given  that  there  are  exactly  n  arrival  times.  The 
MAP  estimate  £n  maximizes  the  quantity 

A{t/o,r  I  U>p(l«  I  Not  =  ")  (3) 

where  A  is  the  likelihood  functional 

«*P  ^2^5  J0  l2v*  ~  *<(l»)ll*<(u)]<fcj  (4) 

and  p(zn  |  No,t  ~  n)  **  th*  joint  probability  density  function  (pdf) 
of  the  first  n  occurence  times  in  (0,7*),  given  that  A’o.r  —  r».  It  is 
noted  that  for  the  state-dependent  rate  esse,  this  pdf  is  not  generally 
differentiable  with  respect  to  tn- 

It  is  also  straightforward  to  produce  a  MAP  estimate  of  No,r<  that 
is,  a  minimum  probability  of  error  detection  of  N0t-  The  MAP  esti¬ 
mate  Not  maximizes 

E{Hy<>T  I  !.}  I  Not  =  n>  Pr{N0,r  =  n}.  (5) 

where,  in  the  constant  rate  case,  the  second  factor  is  a  unimodal  func¬ 
tion  of  n,  with  its  peak  at  E{Nqt}  -  AT. 

In  the  problem  at  hand,  however,  neither  tn  or  No,r  alone  will  suf¬ 
fice.  The  two  procedures  must  somehow  be  merged.  Tbe  issuejs  more 
presting  when  the  model  includes  random  marks  which  can  take  very 
■mall  values.  In  that  case,  the  maximum  likelihood  approach  yields 
unreasonably  large  values  of  N  a*  many  small  impulse  responses  are 
made  to  fit  the  observation  noise.  Any  estimator  of  N  must  sufficiently 
penalise  large  values. 

In  each  of  the  three  approaches  mentioned  above,  a  eingle  exp  res 
non  is  maximized  to  determine  both  the  number  and  times  (also  marks 
in  [8]  and  [5])  of  impulses  in  an  interval.  Au  [8]  used  the  likelihood 
ratio  weighted  by  Snyder’s  sample  function  density  (sfd)  [9]: 

/(£»,!!«; ")  =  P(l»  I  Not  =  "Wife  I  N0j  -  n)  Pr {N0j  =  n>  (6) 

where  denotes  the  random  marks.  It  is  noted  that  tbe  sfd  is  not 
a  joint  pdf,  since  the  dimension  of  its  domain  depends  on  tbe  last 
argument.  For  independently  and  identically  distributed  (iid)  marks, 
the  natural  log  of  the  sfd  simplifies  to 

nlnA-AT+£>p(u,).  (7) 

which,  as  a  function  of  n,  does  not  share  the  characteristics  of 
In  Pr{No,r  =  n};  in  particular,  it  does  not  necessarily  penalize  large 
n.  In  the  algorithm,  extreme  values  of  n  art  prevented  by  limiting  tbe 
rate  of  change  in  the  collection  of  n’s. 

In  treating  the  state-dependent  rate  case,  Ingram  and  Haddad  [6] 
replaced  the  sfd  in  Au’s  approach  with  tbe  joint  pdf  of  Im  and  No.t, 
where  M  is  chosen  to  that  Pr{N0,r  >  Af}  <C  1.  Use  of  an  actual 
pdf  might  seem  appropriate  for  a  MAP  approach.  However,  it  was 
observed  that  for  the  constant  rate  case,  this  pdf  is  constant  with 
respect  to  both  tu  and  n  when  n  <  M.  Thus  for  tbe  constant  rats 
c as*,  the  criterion  a  aLupiy  aaaLmna  likelihood  wi*'  an  uppei  Lou^i 
on  n. 
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Kwakernaak  (5]  applied  Riseanen's  [7]  ehorteet  data  description 
method  to  this  problem.  The  resulting  procedure  is  the  same  as  Au's 
except  that  the  expression  in  (6)  is  augmented  with  the  factors  njLjtj.A 
where  and  4  are  resolutions  for  digitizing  the  ith  mark  and  arrival 
time,  respective!)'.  The  resolutions  are  chosen  to  minimize  the  aug¬ 
mented  expression,  which  is  interpreted  as  the  symbol  length  needed 
to  encode  the  data  n,  Zn,  and  This  method  is  optimal  for  the  data 
length  criterion  and  penalises  high  values  of  n.  However,  it  is  not  read¬ 
ily  applicable  when  the  input  is  a  self-excited  point  process.  This  is  due 
.  to  the  complexity  of  the  expressions  and  the  required  differentiability 
of  the  sfd  in  the  assignments  of  rj,-  and 

2  The  Sequential  Algorithm 

The  approach  taken  in  this  paper  to  the  problem  of  estimating  the  self- 
excited  input  point  process  has  two  steps.  The  first  step  is  to  compute 
the  MAP  estimates  L*  for  every  n  such  that  0  <  n  <  Af,  where  Af  is 
chosen  as  above.  The  likelihood  functional  for  the  n  =  0  case  is 
computed. 

The  second  step  is  to  approximate  the  minimum  probability  of  error 
detection  of  Nqj  by  replacing  the  averaged  likelihood  functional  in  (5) 
with  the  likelihood  functional  evaluated  at  £ i.e.,  by  maximizing 

Hl/of  |£«.*o}Pr{lVo,r  =  "  I  *o}  (8) 

over  0  <  n  <  Af  to  get  IV’o.T-  The  final  estimate  of  input  impulae  times 
>*  t/ftT  ■  Both  terms  in  (8)  are  conditioned  on  zo  because  of  the  state 
dependence  of  the  rate. 

The  main  reason  this  approach  was  chosen  is  its  explicit  depen¬ 
dence  in  both  stops  on  s  priori  statistics.  This  dependence  was  highly 
desirable,  given  the  rather  elaborate  input  model  that  has  been  as¬ 
sumed.  The  reason  why  equation  (5)  was  not  used  is  mainly  due  to 
implementation  difficulties.  Specifically,  the  likelihood  functional  can 
take  on  very  large  values  for  high  signal-to-noise;  this  causes  high  sen¬ 
sitivity  to  approximation  errors  in  the  numerical  integration  needed  to 
perform  the  expectation.  Even  if  this  sensitivity  problem  did  not  exist, 
tbe  multiple  integration  would  not  be  desirable  because  it  is  very  time 
consuming. 

Equation  (8)  is  relatively  easy  to  implement.  The  first  factor  is  a 
byproduct  of  tbe  first  step  and  the  second  factor,  Pt{No,t  —  »  I  *o}, 
can  be  computed  off-line  for  the  desired  range  of  values  for  Xq.  The 
second  factor  naturally  imposes  a  penalty  on  high  n  and  also  noticeably 
dependa  on  so- 

In  the  sequential  algorithm,  this  procedure  is  performed  over  the 
interval  [A,  A+T],  where  A  is  periodically  incremented.  The  smoothed 
state  estimate  i*  takes  the  place  of  zo. 

The  elgorithm  has  been  tested  on  the  following  example.  Let 

MO  =  2e-“u(t) 

M[x,l  *  /  1  1**1  5  I 

Some  values  of  PrfN^j-  =  n  j  z<j},  computed  for  the  example  and 
T  *  0.1875  are  shown  in  the  table.  For  this  interval  size,  Pr{  jVo.r  > 
8}  <  0.0073,  so  it  is  sufficient  to  consider  values  of  n  up  to  Af  =  3.  It 
is  observed  that  as  zo  increases,  the  probability  weight  gradually  shifts 
away  from  n  *  0,  but  the  probabilities  of  n  =  1  and  n  =  3  still  differ 
by  ea  order  of  magnitude.  For  zo  >  2.8,  tbe  distribution  for  0  <  n  <  3 
is  unchanged  because  the  high  initial  condition  ensures  that  m(>i)  =  4 
over  the  whole  interval.  For  o1  =  0.01 ,  all  pulse*  except  the  first  are 
detected  within  the  time  resolution  of  the  eimulation.  As  the  noise 
strength  grows  from  9*  =  0.04,  errors  begin  to  occur.  Additional  tests 
are  being  performed  and  e  theoretical  performance  analysis  is  under 
investigation. 


*0 

n  3=  0 

n  =2  1 

n  =  2 

n  =  3 

0.0 

0.829 

0.121 

0.039 

0.010 

0.2 

0.829 

0.120 

0.040 

0.010 

0.4 

0.829 

0.119 

0.04C 

0.010 

0.6 

0.829 

0.119 

0.040 

0.010 

0.8 

0.829 

0.119 

0.040 

0.010 

1.0 

0.829 

0.119 

0.040 

0.010 

1.2 

0.744 

0.159 

0.075 

0.022 

1.4 

0.679 

0.195 

0.094 

0.027 

1.6 

0.623 

0.231 

0.109 

0.030 

1.8 

0.581 

0.259 

0.119 

0.032 

2.0 

0.547 

0.287 

C.126 

0.033 

2.2 

0.517 

0.309 

0.130 

0.033 

2.4 

0.491 

0.338 

0.133 

0.033 

2.6 

0.472 

0.354 

0.133 

0.033 

Table  1:  Values  of  Pr{  Vqj  =  n  |  Zq}  for  the  example  system. 
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ABSTRACT 

In  this  work,  mean-square  continuity  is  proved  for  the  state  of  a  linear  system  disturbed  by 
a  point  process  with  a  state-dependent  rate  and  random  marks.  The  cross-correlation  property 
and  the  linear  optimal  filter  are  derived  for  the  case  of  zero  mean  marks. 

SUMMARY 

The  model  treated  in  this  summary  is  a  continuous  linear  time  invariant  system  driven  by 
a  self-excited,  marked  point  process.  The  term  “self-excited*  implies  that  the  instantaneous 
average  jump  rate  or  intensity  of  the  point  process  depends  on  the  history  of  the  process.  Thus, 
self-excitation  is  one  kind  of  time-correlation.  In  particular,  the  jump  rate  is  specified  as  a 
memoryless  function  of  the  system  state.  The  term  ‘marked*  describes  a  point  process  with 
random  jump  amplitudes  (marks). 

One  possible  application  of  this  model  is  in  the  tracking  of  maneuvering  targets.  The 
jump  process  represents  the  commanded  acceleration  of  the  vehicle  being  tracked.  The  state- 
dependency  of  the  rate  represents  a  relation  between  the  rate  of  acceleration  jumps  and  the 
position  and  velocity  of  the  vehicle.  Another  possible  application  is  in  state  estimation  for 
systems  subject  to  abrupt  failures,  such  as  the  onset  of  biases  in  sensora  or  actuators.  Here, 
state-dependency  of  the  rate  may  represent  an  increased  vulnerability  to  failures  under  condi¬ 
tions  of  high  heat,  speed,  or  electrical  current. 

The  state  process  is  given  by  the  following  stochastic  differential  equation 

dx i  =  Axtdl  +  BdMt  t  >  0 

where  r«  €  31"  is  the  state,  with  initial  condition  *o,  A  €  R"  x  SR",  B  6  and  Mi  6  31  is  a 
piecewise  constant  random  process  to  be  defined  below. 

Let  A,  denote  the  number  of  jumps  in  Mi.  Let  {u;,  uj, . . .  ,  u*,}  be  the  consecutive  jump 
nr  maxltj  of  Mi.  Then  Mi  may  be  expressed 

S, 

Mt  =  u,-. 

iml 

The  marks  are  assumed  to  be  independently  and  identically  distributed  with  mean  u  and 
mean  square  value  u1.  We  define  the  stochastic  intensity  (1|  or  instantaneous  average  rate  of 
A|  to  be  p[zi|,  where  p  is  a  scalar  valued,  positive,  and  bounded  function  of  the  state  z(  of  the 
system.  It  follows  that  A(  -  /0‘  n[x,]ds  is  a  martingale  with  respect  to  the  <r-algebra  Si  generated 
by  xt.  The  observation  model  is  given  by 

dzt  =  ztdt  +  dw  i 

where  wt  is  an  n-vector  of  Wiener  processes  with  E{dwtdwi}  =  Idt. 

There  has  been  a  fair  amount  of  work  concerning  systems  driven  by  compound  Poisson 
processes,  that  is,  independent  increment  point  processes  with  random  marks.  The  contribu- 
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tions  include  representations  end  properties  (2),  and  mean-aquar*  optima],  linear  optimal,  and 
auboptimal  atate  eatimation  [3,4] .  Self-excited  and  more  general  point  proceaaea  have  received 
attention  [2,1],  mainly  as  models  for  point  process  observations  of  dynamic  systems. 

Martingale  theory  has  been  applied  successfully  in  the  characterization  of  point  processes 
[  1  ]  as  well  as  in  nonlinear  filtering  theory.  Thus,  it  was  desirable  and  instructive  to  use  it  in 
proving  the  following  propositions. 

Proposition  1  If  there  exists  a  constant  K  such  that 

p(i]  <  K  <  +oo 

for  all  x  €  R",  then  xt  «»  mean-ajuare  continuous,  that  it, 

lim  £{||i<  -  r.H1}  =  0. 

where  ||  ■  ||  denote t  the  Euclidean  distance  in  R". 

Proposition  2  If  the  mean  oj  the  marks  is  zero,  i.e.,  if  u  =  0,  then  i,  is  of  the  separable  class 
|5],  which  implies  that  for  any  nonlinear,  scalar  valued  function  j(-),  there  exists  a  constant 
vector  C  such  that 

E{ztg{x,))  =  E{x,x',)C. 
given  that  the  appropriate  expectations  exist. 

The  innovations  approach  of  Kailath  |6j  may  be  used  to  derive  the  linear  filter.  The  mean- 
square  continuity  of  i,  is  sufficient  to  prove  that  the  innovation!  process  v,  =  z(  -  /('o  x,  da, 
where  x,  is  the  filter  output,  is  a  process  of  orthogonal  increments.  When  the  marks  have  zero 
mean,  the  resulting  filter  is 

dit  =  Ai,dt  +  Pfdv  t 
P,  =  AP,  +  P,A'  +  BB'u*  ^  -  P,P, 

Although  Pt  will  be  difficult  to  compute  due  to  the  a  priori  expectation  jujzij,  ‘b*  computation 
can  be  done  off-line.  We  are  currently  investigating  methods  of  computation  and  the  filter 
expression  for  the  case  of  nonzero  mean  marks. 
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ABSTR  4CT 

Modeling  issues  and  the  minimum  mean  squared  error  linear  filter  and 
smoother  are  studied  for  a  linear  system  disturbed  by  a  jump  process  with 
a  state-dependent  rate  and  random  jump  heights.  The  jump  process  is 
defined  in  terms  of  martingale  processes.  Martingale  techniques  are  used 
to  derive  certain  properties  and  second  order  statistics  of  the  jump  and  state 
processes.  It  is  shown  that  the  linear  filter  and  smoother  are  practical  only 
for  the  case  of  zero-mean  jump  heights. 

1  Introduction 

Linear  systems  with  random  impulsive  forcing  functions  have  been  used  to 
model  dynamic  systems  subject  to  abrupt  failures  or  bias  changes  [l]  and 
manuevering  targets  [2],  as  well  as  many  other  physical  situations  (3,  chapt. 
4j.  State  estimation  for  such  systems  from  noisy  observations  has  been  an 
active  research  area  for  many  years.  The  input  process  is  often  described 
as  either  an  independent  increment  compound  Poisson  process  [4, 5, 6, 7]  or 
a  discrete-time  semi-Markov  process  [2,8 j.  In  both  cases,  the  minimum 
mean  squared  error  (MMSE)  filter  is  not  implementable.  Thus  researchers 
have  considered  various  approximations  to  the  MMSE  filter  as  well  as  linear 
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optimal  filters  and  schemes  which  involve  maximum  a  posteriori  (MAP)  or 
MAP-like  criteria  [6,8]. 

This  note  treats  a  linear  system  driven  by  an  extended  version  of  the 
compound  Poisson  process.  The  extension  results  from  allowing  the  instan¬ 
taneous  average  rate  of  the  input  impulses  to  depend  on  the  state  of  the 
linear  system.  In  the  abrupt  failure  application,  the  state  dependency  is 
motivated  by  the  idea  that  a  system  maybe  more  prone  to  failures  when 
it  is  under  some  degree  of  “stress,”  as  defined  by  a  region  of  the  state 
space.  In  the  target  tracking  application,  the  likelihood  of  a  pilot  to  give 
an  acceleration  command  may  depend  on  his  speed  and  position,  again  re¬ 
flected  by  the  state  of  the  system.  Apparently,  this  input  model  has  not 
been  previously  considered  in  the  context  of  state  estimation.  A  related 
model,  a  linear  system  with  Markov  jump  parameters  where  the  jump  rate 
is  state-dependent,  has  been  considered  for  optimal  control  [9]. 

The  objectives  of  this  note  are  to  determine  various  properties  and  quan¬ 
tities  of  the  process  of  interest  which  are  relevant  to  MMSE  estimation, 
and  to  derive  the  MMSE  linear  filter  and  fixed-lag  smoother.  The  note 
is  summarized  as  follows.  Section  2  contains  a  methodical  development  of 
the  properties  of  the  state  process.  The  development  is  based  on  a  semi¬ 
martingale  representation  of  the  counting  process  which  underlies  the  jump 
process.  It  is  this  semimartingale  representation  which  precisely  describes 
the  state-dependency  of  the  system  disturbance.  We  build  up  from  the 
counting  process  to  the  jump  process,  and  finally  to  several  representations 
of  the  state  process.  The  state  process  is  proved  to  be  square  integrable 
and  mean  square  continuous.  Section  3  contains  a  discussion  on  linear  esti¬ 
mators  for  the  state  process,  given  observations  in  additive  white  Gaussian 
noise.  A  recursive  form  for  the  filter  and  fixed-lag  smoother  follow  easily 
when  the  jump  heights  have  zero  mean.  In  the  general  case,  however,  a  re¬ 
cursive  form  of  the  linear  filter  is  not  obtained.  Some  observations  are  made 
from  the  general  filter  expression  and  the  form  of  the  a  priori  covariance 
equations. 
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2  The  State  Process 

In  this  section  we  define  the  state  process  and  discuss  its  properties.  Several 
representations  are  considered,  and  the  properties  of  square  integrability 
and  mean  square  continuity  are  proved.  We  recall  that  these  properties 
are  true  for  the  constant  rate  case,  so  it  should  not  be  surprising  that  they 
follow  for  the  state-dependent  rate  case  when  the  rate  function  is  uniformly 
bounded.  The  definitions  and  proofs,  however,  do  require  some  care. 

The  state  process  is  given  by  the  following  stochastic  differential  equa¬ 
tion 

dxt  =  Axtdt  +  BdMt  t  >  0  (1) 

where  xt  is  an  n-dimensional  state  vector,  with  initial  condition  xq.  The 
nxn  constant  matrix  A  is  assumed  to  be  such  that  the  solution  of  i  =  Ax  is 
exponentially  stable.  The  scalar  process  M(  is  a  piecewise  constant  random 
process.  The  jumps  of  Mt  occur  at  an  instantaneous  average  rate  which 
depends  on  xt. 


2.1  The  Input  Process 


The  process  M,  is  known  as  a  jump  process.  Its  definition  depends  on  a 
sequence  of  ordered  pairs  {(ri,  Ui),  (r2,u2), ...},  where  r,  >  0  is  the  time  of 
the  ith  jump  and  ut  is  the  jump  height.  This  sequence  of  ordered  pairs  is 
known  as  a  point  process,  and  the  u’s  are  the  marks  of  the  point  process. 
The  t  sequence  may  be  equivalently  represented  by  the  counting  process 
Nt,  which  is  the  number  of  jumps  prior  to  time  t.  Thus  the  jump  process 
Mt  may  be  expressed 
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In  this  note,  the  marks  axe  assumed  to  be  independent  and  identically 
distributed  (iid)  with  probability  density  function  (pdf)  p„(u),  mean  u,  and 
mean  square  value  u:.  Also,  u,  is  independent  of  {N,,M,;0  <  s  <  t}  for 
j  >  Nt. 

The  state  dependency  of  the  rate  is  made  precise  in  the  definition  of 
the  counting  process  Nt.  Because  martingale  theory  is  known  to  be  quite 
powerful  in  the  analysis  of  point  processes  on  the  real  line  [10, ll]  and  it  is 
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fundamental  to  nonlinear  filtering  theory  [12],  Nt  v’ill  be  defined  in  terms 
of  martingales. 

Let  St  be  the  smallest  a-algebra  containing  the  histories  of  both  the 
state  xt  and  the  counting  process  Nt,  i.e.  St  —  cr{xs,N,-,s  <  t}.  The 
reason  Nt  is  explicit  in  the  definition  of  St  is  because  if  a  mark  u  can  take 
the  value  of  zero,  then  a  jump  in  Nt  may  not  coincide  with  a  jump  in  xt. 
St  is  assumed  to  possess  the  “usual”  properties  of  completeness  and  right 
continuity  [13].  Consider  the  process  At  =  n  :  Zn  Z  +  ,  where  /x 

is  such  that  0  <  n[x\  <  K  <  +oo  for  all  x  6  Zn  and  some  constant  K. 
We  specify  Xt  to  be  a  stochastic  intensity  with  respect  to  St,  or  simply  an 
5r intensity,  for  Nt  [11,  p.  27].  Note  that  the  definition  of  5(-stochastic 
intensity  requires  that  Nt  be  measurable  with  respect  to  St,  hence  the  need 
for  Nt  in  the  definition  of  St ■  An  informal  interpretation  of  \t  is  that  on 
the  infinitesimal  interval  \t,t  +  dt),  Nt  acts  like  a  Poisson  process  with  rate 
parameter  \t,  or  that  Pr{dlVt  =  1  ]  St}  =  A tdt. 

By  definition  of  St-stochastic  intensity, 

Dt=Nt  -  f*  X,ds  (3) 

Jo 

is  an  St-martingale.  The  integral  /0‘  A ,ds  is  also  known  as  the  unique  pre¬ 
dictable  compensator  for  Nt  with  respect  to  S(  [10,  p.  59]. 

It  is  known  that  a  counting  process  Nt  may  have  more  than  one  stochas¬ 
tic  intensity  with  respect  to  a  given  growing  a-algebra,  but  there  is  only 
one  stochastic  intensity  which  is  predictable  [11,  p.  30].  If  n\xt\  has  left 
and  right  limits,  then  A<  =  /z[z<_]  is  predictable  [14,  p.  46],  However,  pre¬ 
dictability  is  not  required  for  the  results  in  this  paper;  any  Xt  differing  from 
A{  on  a  set  of  Lebesgue  measure  zero  may  be  used. 

The  description  of  a  point  process  using  a  stochastic  intensity  is  a  rel¬ 
atively  modern  approach  to  the  modeling  of  point  processes.  One  of  the 
classic  approaches  is  to  define  a  point  process  by  the  joint  probability  dis¬ 
tribution  of  its  jump  times  and  marks  and  by  the  distribution  of  Nt.  In 
his  book  [3],  Snyder  reviews  distributional  descriptions  for  many  classes 
of  point  processes  on  the  real  line.  One  class  of  processes,  the  class  of 
“marked  self-excited  point  processes,”  includes  the  process  of  interest  in 
this  note  [3,  p.467].  The  term  self-excited  means  that  the  present  and  fu¬ 
ture  statistics  of  the  process  depend  on  its  own  past.  It  is  noted  that  the 
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specification  of  A(  =  p[xt\  as  a  stochastic  intensity  of  Nt  is  consistent  with 
the  distributional  characterization  of  a  marked  self-excited  point  process 
where  the  self-excitation  is  through  state  dependency.  The  connection  is 
made  through  a  theorem  stated  by  Brema”d  [  1 1 ,  p.  61].  We  note  in  passing 
that  the  distributional  description  is  useful  in  certain  maximum  a  posteriori 
(MAP)  approaches  to  this  state  estimation  problem  [15,16] . 

The  definition  of  Nt  will  now  be  generalized  to  include  random  jump 
heights  or  marks.  This  procedure  will  lead  to  a  decomposition  of  Mt  similar 
to  the  decomposition  in  equation  (3).  The  decompositon  of  Mt  will,  in  turn, 
lead  to  a  useful  decomposition  of  xt. 

Let  the  mark  sequence  {un,n  >  1}  take  its  values  in  the  measure  space 
( U,U ).  The  idea  of  the  counting  process  Nt  may  be  generalized  to  a  counting 
measure  p((0,  f]  x  A),  which  is  the  number  of  jumps  in  Mt  that  have  marks 
(or  jump  heights)  in  the  set  A  £  U  [11,  p.  234].  It  follows  that  Nt  = 
p((0,f]  x  U )  and 

Mx  =  J  up((0,  t\  x  du).  (4) 

If  Nt  has  the  stochastic  intensity  A*  and  the  future  marks  are  iid  and  inde¬ 
pendent  of  St,  then  it  follows  that  the  stochastic  intensity  of  p((0,t]  x  A), 
is  AtPr{A).  A  heuristic  argument  is  given  below. 

Pr {p{dt  x  A)  =  1  |  St}  =  Pr {dMt  £  A  |  dNt  =  1,  St}  Pr{dNt  =  1  |  St} 

=  Pr{u;v1+1  £  A  |  dNt  =  l}Atdf 

=  /  pu(u)duXtdt. 

J  A 

A  corollary  of  Bremaud  [11,  p.  235]  then  implies  that 
R:  =  j  J  tl  [p(ds  x  du)  —  X, pu(u)dsdu) 

—  Mt  -  [  uX3ds  (5) 

Jo 

is  an  Sj-naartingale  and  Qt  =  /d  uX,ds  is  the  unique  predictable  compen¬ 
sator  of  Mt.  It  is  noted  that  (5)  is  also  the  unique  decomposition  with 
respect  to  St  =  a{x,\0  <  s  <  t}.  This  is  because  dMt  is  conditionally 
independent  of  {IV,; 0  <  s  <  t}  given  xt.  It  is  further  noted  that  Rt  is  an 
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orthogonal  increment  process  (its  formal  derivative  is  white  noise)  since  all 
martingales  have  orthogonal  increments. 

In  the  appendix,  Rt  is  shown  to  be  an  L2-martingale  with  quadratic 
variance 

(R,R)t  =  [  u2n[xa\ds  (6) 

Jo 

which  implies  that  R\  —  (R,R)t  is  an  Srmartingale  [17,  p.  1 15 J .  It  follows 
that 


E{W  =  £{(£,*>,„} 

rt  At _ 

—  u2  n[xa}ds.  (7) 

Jo 

where  t  A  r  =  min(t,r).  The  expressions  in  (6)  and  (7)  will  be  employed 
below  in  a  bound  on  state  variance  and  in  the  linear  filter  expression,  re¬ 
spectively. 
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2.2  Representations  and  Properties 

The  state  process  has  several  representations.  These  representations  will 
be  reviewed  and  then  one  will  be  used  to  prove  the  square  integrability  and 
mean  square  continuity  of  xt. 

The  expressions  in  (l),  (2),  and  (3)  constitute  one  representation  for  xt. 
It  is  also  possible  to  give  an  augmented  state  equation  with  an  independent- 
increment  excitation.  Let  Mt  be  a  compound  Poisson  process  with  iid 
pu(u)-distributed  marks  and  a  unity  jump  rate.  It  follows  from  a  theorem 
and  lemma  of  Bremaud  [11,  p.  41],  that  if  At  is  uniformly  lower  bounded 
away  from  zero,  then  xt  may  also  be  represented  by  the  following  equations: 

dxt  =  Ax t  +  BdMr,  ,  . 

dr  i  =  ii[xt\dt  ' 

where  the  augmented  variable  performs  time  scaling  of  the  input  process 
Mt.  While  the  representation  in  (8)  has  not  been  useful  in  analysis,  it  is 
useful  for  computer  simulations. 

It  is  easily  observed  that  xt  is  a  Markov  process  with  a  stationary 
transition  function,  since  the  statistics  of  dMt  in  equation  (l)  are  com¬ 
pletely  determined  by  xt.  More  specifically,  xt  is  in  the  class  of  Piecewise- 
deterministic  Markov  processes,  a  class  described  by  Davis  [18]  that  “covers 
virtually  all  non-diffusion  applications.” 

When  a  Markov  process  has  a  stationary  transition  function  and  is  con- 
tinous  in  probability  (which  is  implied  by  mean  square  continuity,  proved 
below),  then  its  transition  function  is  uniquely  determined  by  its  differential 
generator  A  [19,  p.  184].  The  operator  A  can  be  used  to  derive  differential 
equations  which  propagate  the  covariance  of  xt  as  well  as  other  expected 
values  associated  with  xt.  Given  a  continuously  differentiable  function  /, 
an  expression  for  Af(xt)  may  be  found  by  simplifying  the  general  formula 
in  Davis  [18]: 

Af(xt)  =  +  n\xt\  /( xt  +  Bu)pu(u)  du  -  /(x,)J  .  (9) 

A  property  of  the  generator  is  that 

f{xt)  ~  /(*,)  -  J'  Af{xr)  dr  (10) 
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is  a  martingale  for  t  >  s  [18]. 

The  adjoint  operator  Am  yields  the  following  evolution  equation  for  the 
pdf  of  xt ,  assuming  the  density  exists. 


dpt(x) 

dt 


A'pt{x) 


d_ 

dx 


\-Axpt{x)\  +  /  pt{x  -  y)p\x  -  y}pBu{y )  dy  -  pt(x)n\x]  (11) 
J  kn 


where  Pbu(')  is  the  pdf  for  the  random  vector  Bu.  It  is  observed  that  A *  is 
not  a  local  operator  because  of  the  shifts  in  the  convolution  integral.  The 
shifts  also  imply  that  the  steady-state  equation  is  classified  as  a  differential 
delay  equation. 

Taking  /  in  equation  (10)  to  be  the  identity  yields  the  Srsemimartingale 
decompositon  of  xt,  which  is  useful  for  deriving  the  MMSE  nonlinear  filter. 
This  decomposition  can  also  be  deduced  directly  from  (l)  and  (5)  and  is 
given  by 


Xt  —  xq  +  Gt  +  Ht  (12) 

Gt  =  Ax,  +  Bup[xs]  ds 

Ht  =  f  BdR,. 

Jo 


where  Ht  is  the  martingale  by  an  important  property  of  stochastic  integrals 
[12]  and  Gt  is  predictable  since  it  is  continuous. 

The  final  representation  to  be  considered  is  a  decomposition  of  the  su¬ 
perposition  integral.  Let  $(t)  be  the  state  transition  matrix  corresponding 
to  the  plant  matrix  A.  Then  xt  can  be  represented  as 


$(f)x0+  [t^{t-s)BdM, 

Jo 

(13) 

$(t)x0  +  *(t)Vi  +  $(t)Jt 

(14) 

[  $(-s)Bti/i[i,]ds 

Jo 

(15) 

[t${-s)BdR, 

Jo 

(16) 

Here  $(t)  is  factored  out  of  the  integrals  to  enable  the  semimartingale 
decomposition  involving  It  and  Vt.  This  representation  rather  than  (12) 
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is  used  to  prove  square-integrability  and  mean-square-continuity  of  xt  be¬ 
cause  the  integrand  of  Vt  is  a  uniformly  bounded  function  of  x,,  while  the 
integrand  of  Gt  is  not.  These  properties  are  proved  below. 

The  process  xt  is  square  integrable  iff  [17] 

E{  M2}<+oo. 

By  the  triangle  and  Cauchy-Schwarz  inequalities, 

+  2[(E{A*}E{B,})11  +  (£{A!}£{C!»*  +  (£{BJ}£{C!})(]  (17) 

where 

A  =  ||*(t)x0|| 

B  =  ||*(t)Vt|| 

C  =  ||*W/.II 

and  where  ||x||  is  the  Euclidean  norm  for  x  6  Zn  and  ||$||  is  the  matrix  norm 
induced  by  ||x||.  The  equation  x{  =  Axt  is  assumed  to  be  exponentially 
stable,  which  implies  that  there  exist  positive  constants  7  and  r)  such  that 
||4(0I|-  <  ie~nt  [20].  Also,  for  random  x0,  we  assume  £{|]x0||2}  <  +00  and 
recall  that  n\x\  <  K  for  all  x.  Hence,  the  following  inequalities  are  implied: 

£{||*(«)*oll>}  <  ||*(I)||!£{|M!}  (18) 

£{||*(!)V,|r} 

=  E  jy"  j  u2 n\x,]n[xT}BT$(t  -  s)r$(f  —  r)Bdsdr | 

<  r  /*  u2K 2  -  s)T$ (t  -  t)B\  dsdr 

Jo  Jo  '  1 

<  u*k2\\b\\2  fm-s)\\ds  f'm-T)\\dT  (19) 

Jo  Jo 

One  of  the  properties  of  an  L2-martingale  Rt  is  that  if  C,  is  a  bounded 
predictable  process,  then  the  stochastic  integral  rpt  =  /0‘  C,dR ,  is  again  an 
L2-martingale  with  predictable  variation 

«M«,  =  fc]i(R,R)„ 

Jo 
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and  E{tp {2}  =  E{{tp,x{))t}  [17].  By  a  straightforward  generalization  to  the 
vector  case  of  (16),  it  follows  that 

=  Trace  E  $(£  —  s)Bu2fi\xa]BT$(t  —  s)Tds| 

<  E  [JQ  «2mMII$(*  -s)||2||B||2(is} 

<  rfK\\B\\2  ['{^{t-s^ds  (20) 

Jo 

Because  x  =  Ax  is  exponentially  stable,  all  of  the  integrals  of  $(£  —  s)  are 
bounded  by  a  constant  [20].  Thus  the  bound  on  iE7{ jj jct || 2}  does  not  depend 
on  time,  and  xt  is  square  integrable. 

Next  xt  is  shown  to  be  mean  square  continuous.  This  property  justifies 
the  use  of  innovations  in  deriving  the  MMSE  linear  estimators.  In  order  to 
show  mean  square  continuity  we  must  show 

lim£{||xt  -  xw |)2}  =  0. 

Let  t v  <  t.  The  representation  in  equation  (13)  may  be  used  to  write 

xt  —  xw  —  [$(t  —  to)  —  I\xv  +  f  $(£  -  s)BdMa. 

Jw 

Steps  parallel  to  (14)  and  (17)  yield  the  inequality: 

lilt  -  Zw\\2  <  H$(t  -  w)  -  JflM2  +  ||$ (t  -  ti>)V’«.i||2  +  ||$(t  -  w)IW't\\2 


where  Vw  t  and  Iw  t  are  defined  the  same  as  in  (15)  and  (16),  respectively, 
except  the  lower  integration  limit  of  0  is  replaced  by  w.  Continuation  of  the 
same  procedures  yields  inequalities  nearly  identical  to  (19)  and  (20),  dif¬ 
fering  only  in  the  lower  integration  limit.  Further  simplification  is  possible 
by  using  the  inequalities  below  [20].  Let  a  =  ||.A||. 


||*(*  -  «p)  -  J||  < 

< 

f  ll$(*  ~  5)llds  < 

J  w 

fm-s)\\'ds  < 

Jw 


aJ0  ll*(«)ll<& 
?f[l  -  )] 

2  [i_e-M«-H] 
£  [l  -  e-2A(f_“')] 
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Substitution  of  these  inequalities  leads  to  the  expression 
£{||*.  -  x.||2}  <  ^£{||x.f }  [l  - 

+  u’K'WBW2^  [l  -  [l  -  e-2A<‘-“'>] 

The  boundedness  of  £■{ || ]| 2}  thus  implies  E{ ||x(  —  x,J|2}  — ►  0  as  t 
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3  The  Linear  Filter  and  Smoother 


For  the  estimation  problem,  we  assume  the  m-dimensional  observation  pro¬ 
cess  to  have  the  following  form: 

dyt  =  Cxtdt  -t-  dvt  (21) 

where  the  matrix  C  is  such  that  A  and  C  yield  a  completely  observable 
system.  The  observation  noise  Vt  is  an  m-dimensional  Wiener  vector,  inde¬ 
pendent  of  xt,  with  E{vtvJ }  =  ^,-dT. 

Because  xt  is  mean  square  continuous,  it  belongs  to  the  Hilbert  space 
spanned  by  all  mean  square  continuous  random  processes.  Thus  its  opti¬ 
mal  linear  filter  exists  as  the  projection  of  xt  onto  the  growing  subspace 
generated  by  the  observation  process  yt.  But  the  practicality  of  the  filter  is 
not  guaranteed.  An  interesting  characteristic  of  the  process  xt  described  in 
Section  2.2  is  that  it  appears  to  be  on  the  “borderline”  of  the  set  of  pro¬ 
cesses  for  which  recursive,  finite-dimensional  filters  exist.  This  is  because 
in  the  case  of  zero  mean  marks,  the  linear  filter  expression  is  simple  and 
familiar,  whereas  for  the  case  of  non-zero  mean  marks,  a  recursive,  finite¬ 
dimensional  filter  does  not  seem  to  be  possible.  In  this  section,  we  give  the 
linear  filter  and  fixed-lag  smoother  for  the  case  of  zero  mean  marks,  and 
discuss  the  difficulties  associated  with  the  case  of  non-zero  mean  marks. 

3.1  Zero-Mean  Marks 

When  the  marks  have  zero  mean,  i.e.  when  u  =  0,  the  input  jump  process 
Mt  is  a  martingale  and  hence  has  orthogonal  increments.  The  filtering  prob¬ 
lem  is  classified  by  Kailath  [21]  as  the  Stratonovich-Kalman-Bucy  (SKB) 
problem,  for  which  the  filter  equations  are  well  known.  Let  xt  be  the  op¬ 
timal  linear  estimate  of  zt,  z,  =  xt  -  xt,  and  P(t)  =  E{xtxf}.  The  linear 
filter  equations  are: 

dxt  =  Axtdt  + P{t)CTV;ldvt 

P(t)  =  AP{t)  +  P{t)AT -P(t)CT*;1CP{t)  +  Bu*n[xt]BT.  (22) 

where  ut  =  y(  —  /0‘Cz,ds  is  the  innovations  process.  The  only  unusual 
characteristic  in  these  equations  is  the  a  priori  expectation  p[xt  =  E{p[xt\}. 


Recall  that  the  rate  function  p,  is  necessarily  nonlinear,  since  it  must  be 
positive.  We  have  found  that  for  a  scalar  system  and  the  simple  function 


H\x\ 


kx  |x|  <  a 
fc2  |x|  >  a 


for  some  a  >  0,  n\xt)  can  be  well  approximated  by  numerically  propagating 
the  pdf  according  to  equation  (ll)  and  computing  E{iJ.[xt}}.  We  also  note 
that  for  scalar  systems  with  certain  mark  distributions,  it  is  possible  to 
derive  the  steady  state  pdf. 

The  fixed-lag  smoother  for  the  zero  mean  mark  case  can  be  derived  using 
Kailath’s  procedure  [23],  except  that  martingale  properties  are  invoked  in 
the  computation  of  the  error  covariance  P(s,  t)  =  E{x,x?},  and  differentials 
are  used  instead  of  derivatives  where  appropriate.  Let  x£|£+A  denote  the 
optimal  linear  estimate  of  x£  given  the  observations  up  to  t  +  A,  with 
A  >  0.  The  equations  that  constitute  the  smoother  are 


dxt  )4+A  =  dxt  +  P(t)d&  +  dP(t)&, 
the  equations  in  (22),  and 

d£t  =  -[A  +  P{t)CT<b;1C]T ttdt  -  CT<H;ldvt  +  £(t  +  A,0rCr*r+1A<ii/l+A 

where  $(t,  s)  is  the  state  transition  matrix  associated  with  the  plant  matrix 
At  =  A  —  •P(t)CT^t_1C  and  which  maps  from  time  t  to  time  s.  Let  Et|£+A 
denote  the  error  covariance  of  x(|f+&.  The  reduction  in  error  covariance  due 
to  the  lag  is  given  [23]  by 

${s,t)TcT*;1c${s,t)ds)  p(t). 


3.2  Nonzero-Mean  Marks 

The  linear  filtering  problem  becomes  more  complex  when  the  marks  of  the 
input  process  have  a  nonzero  mean.  The  complexity  derives  from  the  fact 
that  the  compensator  (i.e.  the  non-maxtingale  part)  of  the  jump  process 
becomes  nonzero  and  random.  Two  approaches  were  used  on  this  problem. 
The  objective  of  the  first  approach  was  to  derive  the  filter  directly  using 
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the  innovations  method.  The  objective  of  the  second  approach  was  to 
find  the  Gaussian  process  with  the  same  autocovariance,  and  then  write 
the  optimal  filter  for  the  Gaussian  process.  The  merit  of  the  innovations 
approach  is  that  it  produces  a  filter  expression,  which  may  be  simplified  as 
much  as  possible.  The  autocovariance  approach  is  useful  because  it  implies 
an  interesting  interpretation  of  the  compensator.  In  both  approaches,  the 
difficulty  arises  in  covariance  equations  involving  n[xt\.  Both  approaches 
are  summarized  below. 

For  the  innovations  method,  it  is  useful  to  consider  the  perturbation  of 
x*  from  its  mean  xt  =  E{xt},  denoted  by  6xt  =  xt  -  xt.  An  expression 
for  xt  may  be  found  by  taking  the  expectation  of  both  sides  of  (13).  A 
representation  for  6xt  is  then 

6xt  =  $(f  -  s)6x,  +  J'  -  r)Bu6nr  dr  +  $(t  -  r)BdRT  (23) 

where  6nt  —  n\zt]  ~  p\zt\  -Rf  Is  defined  in  (5). 

The  projection  form  of  the  optimal  linear  filter  is  [21] 

Sx,  =  f  E{6xtxJ}CT'*;ldu,. 

J  o 

Substitution  of  (23)  and  interchange  of  integration  order  in  the  6fx  term 
yields 

d  =  A6xtdt  +  Bu6ntdt  +  P(t)CT dut.  (24) 

Application  of  the  orthogonality  principle  and  some  algebra  yieldo  the  error 
covariance  equation 

P(t)  =  AP(t)  +  P{t)AT  +  BuPfx{t)  +  PliX{t)uBT 

+  -  P{t)CT*;lCP{t),  (25) 

where 

PMI(t)  =  CMI(t)  -  E{6fit6xt } 

and  C^zlt)  is  the  covariance  of  n\xt\  and  xt. 

It  is  observed  from  equation  (24)  that  for  a  recursive  filter  to  exist,  there 
must  be  a  recursion  for  6^v  However,  there  is  reasonable  doubt  that  such  a 
recursion  exists  or  is  worth  the  effort  to  derive,  since  the  dynamics  of  /x[xt] 
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are  very  complex.  The  presence  of  n[xt\  as  a  factor  in  (9)  indicates  that 
the  differential  equations  for  any  expectation  involving  n\xt\  will  depend  on 
higher  order  moments  of  n\xt\. 

In  the  second  approach,  the  differential  generator  is  applied  to  f{xt)  — 
x,  tXjt  (Xij  is  the  ith  component  of  xt )  to  get  a  differential  equation  for 
Czx(t,t)  =  E{6xt  Sxf}.  The  result  is 

Cxz{t,t)  =  ACxz(t,t )  +  Czx{t,t)AT  _ 

+  BuC*z{t,t)  +  CpZ(t,t)uBT  +  n[xt]  BBTu2.  (26) 

In  addition,  it  follows  easily  that 

~Cxx{t,s)  =  ACzz(t,s)  +  BuC*x(t ,  s) .  (27) 

Now  consider  the  Gaussian  process  xt: 

dxt  =  Axtdt  -f  B{Btdt  +  du>^) 

where  w W  is  a  scalar  Wiener  process,  and  9t  is  a  scalar  colored  Gaussian 
noise  which  satisfies 

ddt  =  FOtdt  +  Gdw j2^ 

where  is  also  a  Wiener  process.  The  covariance  of  xt,  when  separated 
out  of  the  covariance  expression  for  the  augmented  state  \xj,$t),  matches 
(26)  and  (27)  if  0  is  replaced  by  u^[it]  and  the  diffusion  of  w(l)  is  assumed 
to  be  u2  n[xt\.  There  is  also  a  match  between  the  filter  equations  for  the 
Gaussian  process  and  the  equations  (24)  and  (25),  when  the  appropriate 
notational  substitutions  are  made. 

These  similarities  imply  that  the  compensator  of  the  input  jump  process 
plays  the  role  of  the  ‘colored  part’  of  the  input  noise.  However,  for  the 
reasons  given  earlier,  the  evolution  equation  for  C>4I(t,t)  is  much  more 
complex  than  for  Cge{t,t)  in  the  Gaussian  case. 

4  Conclusions 

Martingale  techniques  have  enabled  a  rigorous  and  complete  derivation  of 
certain  representations  and  second-order  statistics  for  the  state  process  of 
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interest.  The  linear  filter  and  fixed-lag  smoother  were  given  for  the  case  of 
zero-mean  marks  in  the  system  disturbance.  In  the  nonzero-mean  case,  the 
optimal  linear  filter  did  not  seem  to  have  a  finite,  recursive  implementation. 
However,  the  form  of  the  filter  expression  suggests  that  if  a  recursive  linear 
filter  exists  for  the  compensator  of  an  arbitrary  jump  process  disturbance, 
then  a  recursive  linear  filter  may  exist  for  the  state  process. 
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Appendix 


The  following  proposition  is  similar  to  a  Lemma  of  Segall  [14,  p.  85], 
which  proves  that  a  counting  process  with  a  continuous  compensator  is 
locally  square  integrable.  Here,  the  same  property  is  proved  for  a  jump 
process  whose  underlying  counting  process  has  an  absolutely  continuous 
compensator  (i.e.  an  intensity)  and  whose  marks  are  iid  and  mean  square 
bounded. 

Proposition:  Let  Mt  be  a  jump  process  whose  counting  process  Nt  has 
an  St- intensity  such  that  At  <  K  for  all  t.  Let  the  jumps  or  marks  of  Mt  be 
independent  random  variables  with  mean  value  u  and  mean  square  value 
u2.  It  follows  from  Section  2.1  that  Rt  =  Mt  —  fo  u\,  ds  is  an  St-martingale. 
Below  it  is  proven  that  Rt  is  locally  square  integrable,  and  further,  that  Rt 
is  an  L2-martingale. 

Proof:  By  definition,  the  St-martingale  Rt  is  locally  square  integrable  if 
there  exists  a  family  of  St  stopping  times  (Tn,n  >  0),  satisfying  the  prop¬ 
erties  Tn  <  Tn+1  and 

lim  Tn  =  +oo  (a.s.),  (28) 

n— •  +  00 

such  that  for  each  n, 

Q  <  t  <  +°°- 

where  t  A  Tn  =  min(t,rn). 

Define  Tn  as  the  time  of  the  nth  jump  of  Nt.  Since  {w  :  Tn(uj)  <  t}  = 
{w  :  Nt(w)  >  n},  each  Tn  qualifies  as  an  St-stopping  time. 

To  prove  equation  (28),  we  require  only  that  E{Nt}  <  +oo  for  each 
t  >  0,  which  follows  from  the  bound  on  Xt.  Define  the  stopping  time  T  as 
follows. 

T  —  lim  Tn 

r»— »  +  oo 

It  is  observed  that  T  =  +oo  (a.s.)  iff  Pr{T  >  t}  =  1  for  all  t  <  +oo. 
Suppose  the  opposite  is  true,  i.e.  that  there  exists  a  <  +oo  such  that 
Pr{T  <  q}  >  0.  By  the  fact  that  Nt  is  increasing,  we  have  that  E{Na }  = 
+00,  which  contradicts  our  assumption,  therefore  (28)  is  proved. 
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Finally,  we  address  the  process  Rt.  In  Section  2.1,  it  was  shown  that 
Qt  =  JquX,  ds  is  the  compensator  of  Rt.  Expansion  of  .R2ATr>,  and  use  of 
the  triangle  and  Cauchy- Schwarz  inequalities  yields 

<  E{ML r„} 

+  2 

The  definition  of  Tn  implies: 

The  remaining  term  is  bounded  as  follows. 

Therefore,  E{RfATn}  <  +oo  and  Rt  is  locally  square  integrable. 

To  prove  square  integrability  of  it,  it  is  convenient  to  be  able  to  define 
the  quadratic  variance  of  Rt  without  having  to  use  the  stopping  times  {Tn}. 
The  property  we  desire  is  for  Rt  to  be  an  L2-martingale,  which  means  that 
for  each  t  >  0,  E{Rf}  <  +oo  [17,  p.112].  Since  RtArn  »s  square  integrable, 
its  quadratic  variation  exists  [19,  p.238],  and  is  defined  as  the  predictable 
compensator  for  the  quadratic  variation  [i?,  i?jtAxn.  Since  RtAT„  has  no 
Wiener  component, 

=  E 

0<j<(tAT„) 

where  the  summation  is  over  all  jumps  in  Ra  up  to  time  t  A  Tn.  Since  Qt  is 
continuous,  (ARiaT„)2  =  (A  MtATn)2.  Therefore  [#,  f2]tAr„  is  a  jump  process 
with  the  same  counting  process  as  M(Ar„,  but  whose  marks  are  the  square 
of  the  marks  of  MtArn •  We  may  deduce  from  equation  (5)  that 

<  R,R  >tATn=  /  u2\,  ds. 

Jo 

It  is  known  that  <  R,R  >iat„  also  compensates  R?ATn,  therefore 

E{RfA r„}  =  E{<  R,  R  >«at„} 

<  u2K[t  A  Tn ). 
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Taking  the  limit  of  both  sides  as  n  — ►  +  oo  yields 

E{R]}  <  u*Kt. 
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ROBUST  DESIGN  PROBLEMS:  A  CEOMETRIC  APPROACH 


Erik  I.  Verriest  and  W.  Steven  Gray 

School  of  Electrical  Engineering,  Georgia  Inatitute  of  Technology, 
Atlanta,  Georgia  30332-0250* 


Analogous  to  the  correspondence  between  observability  and  identifi¬ 
cation,  a  correspondence  r»'ating  controllability  to  a  "dual"  of  the 
identification  problem:  the  "DESICN"-prob lem  is  established.  This 
amounts  to  the  choice  of  a  realization  or  approximation  of  a  desired 
system  response,  e.g.,  in  view  of  minimizing  the  effects  of  component 
tolerances  in  analog  systems  or  finite  wordlength  effects  in  the  dis¬ 
crete  case.  A  geometric  approach  to  the  design  problem  is  presented, 
and  its  solution  given  under  a  useful  criterion  for  optimality.  For 
linear  time  invariant  systems,  the  minimum  sensitivity  realizations 
are  linked  to  the  Balanced  Realizations. 

1.  THE  PROBLEM  DEFINITION  AND  HISTORY 

This  paper  deals  with  a  new  geometric  approach  to  the  robustness  problem. 
Classically,  the  sensitivity  properties  of  a  given  realization  have  been  inves¬ 
tigated,  via  a  "sensitivity  system"  [12,3],  or  via  the  operator  form  [11]. 
The  questions  of  robustness  with  respect  to  variations  of  certain  structural 
parameters  is  closely  related  to  this  problem,  and  treated  by  Ackermann  in  [1]. 
A  geometric  point  of  view  was  recently  introduced  by  Delchamps  [2],  and  applied 
to  compensation  and  feedback.  Our  emphasis  will  be  in  optimal  implementations 
of  systems  with  quantized  or  inaccurate  parameters. 

Consider  a  linear  time  invariant  system  (A,B,C)  with  m  inputs  and  p  outputa. 
This  may  be  a  model  for  a  real  system  one  wants  to  simulate,  the  implementation 
of  a  digital  or  analog  filter,  or  an  observer-controller  implementat ing  an 
optimal  regulator  for  some  given  plant.  In  all  these  applications,  only  the 
relationship  between  the  input  and  the  output  of  the  implemented  system  is 
important.  Usually  the  so-called  "Canonical  Forms"  are  implemented  becauae 
they  minimize  the  number  of  parameters  and  allow  for  a  pipelined  realization. 
This  corresponds  to  minimal  complexity,  a  quality  that  may  be  important  if  the 
operation  count  becomes  important.  However,  a  minimal  set  of  parameters  has  no 
redundancy,  and  therefore,  high  sensitivity. 

This  paper  investigates  how  the  nonuniquenesa  of  the  state  space  realiza¬ 
tions  can  be  utilized  to  determine  optimal  parameterizations  under  various 
measures  of  "optimality"  or  robustness. 


♦This  research  is  supported  by  the  U.S.  Air  Force  under  Contract  No.  F08635-84- 
C-0273,  and  AFOSR-87-0308. 
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natural  interpretations,  Che  realization  space  is  simply  assumed  to  have  the 
structure  of  an  affine  apace  of  dimension  n(n*m*-p).  The  apace  is  given  the 
structure  of  a  Kiemnnnian  manifold  by  introducing  an  Euclidean  metric  in  the 
tangent  space  at  each  point.  For  instance,  in  the  analysis  and  design  of  the 
finite  wordlength  effects  with  fixed  point  processing,  a  uniform  metric  for  all 
tangent  spaces  is  appropriate,  whereas  for  floating  point  processing,  a  metric 
varying  smoothly  from  point  to  point  io  more  appropriate. 

This  apace  can  be  resolved  (i,e.,  partitioned  into  equivalence  classes) 
into  disjoint  seta,  corresponding  to  different  input/output  behaviors.  For  a 
particular  realization,  the  proximity  of  neighboring  sheets  will  be  an  indica¬ 
tion  for  the  robustness  or  sensitivity  of  this  realization.  These  geometric 
notions  are  made  precise  in  Section  3,  after  giving  a  more  phi loaophical 
introduction  in  Section  2  on  the  design  problem  and  its  relation  with  other 
systems  problems.  This  theory  is  applied  to  systems  design  in  Section  4.  The 
most  interesting  result  is  the  one  relating  the  minimum  sensitivity  (under  the 
fixed  point  metric)  realizations  to  the  balanced  realizations. 


2.  SITUATION  OF  THE  PROBLEM 

Consider  Che  phenomenon  "linear  system"  as  a  mapping  a  from  a  suitable 
subset  of  the  cartesian  product  of  input  functions  (U)  and  realizations  (L)  to 
Che  set  of  output  functiona  (Y).  For  continuous  linear  time-invariant  systems, 
the  mapping  stands  for  the  convolution  operator 

o  :  U  x  L  *  Y  :  (u(*),s)  ♦  y(-) 

y(t)  “  /  Ce^C  T^Bu(r)dr 

For  discrete  systems  a  similar  expression  results.  We  can  now  look  at  the 
marginal  maps  derived  from  the  linear  system  operator.  In  particular,  if 
S  “  (A,B,C)  is  fixed,  we  define  the  usual  linear  input/output  map  as 

:  U  *  (s)  ♦  Y  :  u  ( • )  -*  y  ( •  ) 

On  the  other  hand,  for  a  fixed  input  j(*),  the  marginal  maps 

Ou  :  (u)  x  L  ♦  Y  :  S  *  y(- ) 

associate  wich  each  realization  S,  e.g.  the  impulse  response  h(t)  if  u(t)  “ 
5(t),  or  the  trsnsfer  function  H(p)  characterizing  Che  steady  state  response  to 
a  sinusoid  u(t)  ■  ePc  of  complex  frequency  p. 


AVAILABLE  COPY 


BEST 


Robust  Design  Problems 


j:3 


control  ami  doconvol  ut  ion  problems  arc  Inver nc  problems  Tor  Che  map  a 
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1  'i  the  Ki'nse  chat  the  former  relates  to  tie  derivation  of  a  right -inverse  ant! 
the  latter  to  „  le  ft  - i  river  no  of  Che  map.  Moreover ,  a  certain  cnua/il  structure 

iK  implicit  in  the  problem.  In  designing  a  control  to  achieve  a  desired 

output,  invariably  "future"  actions  are  understood,  while  in  the  deconvolution 
problem  one  actr  on  observed  data,  and  thus  relates  Che  "pest"  of  u(*)  and 
y(*  ) .  Similarly,  the  construction  of  a  1  e  f  c  -i  aver  t  e  for  o  pertains  to  the 
aystern  identification  problem,  invariably  tied  to  an  observation  of  functions 
or  time  series,  and  hence  relating  the  "past"  of  y(*)  to  the  system.  Finally, 

finding  a  r i ght - inverse  of  is  the  problem  of  "designing"  a  system  with 

desired  "future"  behavior. 

In  the  identification  problem,  the  measured  data  necessarily  has  uncertain¬ 
ties  due  to  the  finite  observation  time,  finite  memory  effects,  and  imperfect 
isolation.  Similary,  uncertainties  interfere  with  the  design  problem:  the 

parameter  settings  necessarily  have  finite  precision.  In  order  Co  find 

"uniquely"  an  "optimal"  solution  to  these  problems,  one  introduces  a  suitable 
distance  or  norm  in  the  domain  and  range  spaces  [14J. 

3.  MAIN  RESUL1S 

A  summary  of  some  known  results  on  the  geometry  of  systems  and  their 
realizations  is  first  given.  The  next  subsection  discusses  the  robust  design 
on  an  abstract  level. 

3. 1  The  Geometric  Structure  of  the  Realization  Space 

Let  Lm  n  p  be  the  realization  space,  i.e.  the  space  of  all  triples  of 
matrices  (F,C,H)  of  dimension  n  x  n,  n  x  m,  and  p  x  n  over  R.  Endow  this  space 
with  an  affine  structure  with  vector  space  gr>(m+r'+p)  ,  Hence,  at  each  point  S, 
there  is  an  attached  vector  space  T<.L  (the  tangent  space  at  S),  isomorphic  to 
Rn(rrr+n  +  p).  The  group  GI  n  ( R )  sets  di  f  ferenciably  on  the  right  to  L^  n  p,  via 
(A ,  B ,  C)  *  (A,8,C)T  ■  (TAT'^TB^T-*)  corresponding  to  a  change  of  base  in  the 
state  space  z  ■  Tx .  The  quotient  space  ia  non-Hausdor f f  in  general. 
Restricted  to  the  completely  reachsble  (or  dually,  the  completely  observable) 
systems,  Che  action  of  Gln(R)  is  free  (as  a  consequence  of  reachability/ 
observability)  and  the  quotient  apace  (set  of  orbits)  MCr  m  LCr  /Gl  (R)  is 
a  smooth  (real)  analytic  manifold  (hence  Hausdorff)  of  dimension  n(m+p).  '  The 
set  of  equivalence  classes  of  minimal  realizations  are  analytic  open 

submanifolds  [5].  This  space,  called  parameter  space,  is  crucial  in 
identification,  and  is  well  studied  (e.g.  in  relation  to  the  (non)existcnce 
of  continuous  canonical  forma  15),  and  degeneration  phenomena  (6)). 

Since  the  isotropy  subgroup  is  trivial  for  all  reachable  or  observable 

realizations,  Its  dimension  Is  constant  on  lC0,Cr,  and  hence,  the  orbita  of 

m,n,p’ 
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form  a  foliation  F  of  l,c  '  ’  '  ‘  of  dimension  n(,r.+  p)  [91.  Th-  u«M  of 

m ,  n  ,  p  '  ' 

tangent  spaces  to  the  loaves  form  nn  n  (mrp)  -d  imens  ion.i  1  subbund  1  e  t(f)  of 
the  tangent  bundle,  called  the  tangent  bundle  to  F ,  l";i e  quotient  bundle  v(F)  " 
TL/t (F)  ta  called  the  normal  bundle  to  F. 

Our  interest  13  not  in  the  universal  parameterization,  but  in  the  orbits 
under  the  action  of  Cl  (R)  itself.  These  orbits  are  open,  and  the  boundary 
points  of  reach,  >le  realizations  are  nonreachable  realizations.  The  explicit 

form  of  Che  closure  of  the  orbits  was  addressed  ir.  [8].  We  shall  endow  the 

co  c r  .  ,  ,  , 

tangent  bundle  TL  *  with  a  positive  definite  metric 
ra ,  n ,  p 


:  TSL  x  TSL 


for  all  S  in  L 


3 . 2  The  Robust  Design  Problem:  A  Geometric  Approach 

Before  proceeding  with  our  system  design,  we  shall  prove  a  general  result  on 
sensit ivity : 

De  f  i  ni  t  ion :  Let  0  be  an  N-dimensional  open  subset  of  an  affine  space  of 

design  parameters  (configurations).  By  an  Observable,  we  shall  mean  any  smooth 

function  f  :  0  +  R  which  has  no  critical  points. 

Any  two  configurations  0^  and  0^  in  the  parameter  space  are  indiscernible  by 

observation  of  f  if  f(0j)  ■  f  ( 0  2 )  *  This  allows  ua  to  regard  two  parameteri- 

zationa  yielding  the  same  observable  value(s)  aa  being  the  aame  (or  equivalent) 

for  aome  purpoae.  In  a  systems  context,  observables  are,  for  instance,  a 

mapping  from  the  realization  space  to  Che  transfer  function  (scalar  caae) 

evaluated  at  a  particular  frequency,  or  the  impulse  response  evaluated  at  a 

specific  instant,  i.e.,  Che  "system  functions"  [3). 

An  observable  induces  a  partition  of  0  into  equivalence  classes,  known  as  a 

foliation.  In  this  case,  the  submanifolds  tre  the  level  surfaces  of  f,  and 

have  dimension  N-l.  There  exists  a  vector  field  normal  (in  terms  of  some 

arbitrarily  chosen  Rieraannian  metric)  Co  the  leaves. 

The  whole  issue  of  the  sensitivity  problem  is  now  to  find  the  poincs  on  the 

leaves  corresponding  to  a  maximal  "separation"  of  the  leaves  of  the  foliation. 

3.2.1  Riemannian  Metrics.  If  9  is  psracompact,  then  a  Riemannian  structure 

G  can  be  put  on  0  (or,  more  exactly,  on  its  tangent  bundle).  This  means  Chat 

for  each  9  c  0,  a  symmetric,  positive  definite  bilinear  form  Cg  is  defined  on 

Che  vector  apace  T.O ,  such  that  C  defines  a  metric  or.  TO,  i.e.  is  a  smooth 
y  o  9  * 

section  of  the  vector  bundle  T_0.  Let  :  T  0  ♦  TO  be  the  natural  iaoraor- 

*  .  ‘  , 
phism  of  each  space  TgO  with  TgO.  If  f  is  a  smooth  map,  the  gradient  of  f  is 

defined  as  the  element  d£*  of  TO  (i.e.  the  vector  field  corresponding  under  the 

map  *  to  the  differencial  form  df).  In  the  local  coordinates,  this  is  given  by 
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wh<‘rL‘  Che  ntnnniJic  ion  convention  is  used.  The  matrix  i 


3  t he  inverse  of  the 


metric  Censor  i g .  .  j 


«ij  -  cC-V^) 

J  30  30J 


The  squared  norm  of  Che  gradient  ia 

17  fl2  -  G(7  f.V  f)  -  glJ 
G  G  G  30 1  36J 

If  0  ia  foliated  by  f,  Chen  Che  cangent  space  4^  to  the  leaf  through  0  ia  an 
N-l -d imena ional  subsoati  of  Tfl9.  ff 

3.2.2  extremal  Sensitivity  Theorem.  Point*  of  extremal  sensitivity 

(with  respect  to  an  observable  f(9)),  are  determined  by  minimization  of  L(0)  “ 

1  2 

— IV  fi  over  the  leaf  characterized  by  a  particular  value  of  the  observable  f. 

2  G 

A  worst  case  analysis  leads  to  the  minimization  of  the  gradient  norm  IV  fl  " 
t/2  0 

G (V  f  , V  f )  ,  or  equivalently, 

G  G  -  1  2 

h  -}ivcf.2 

This  scalar  field  induces  a  vector  field  in  the  tangent  space  A  of  the  leaf. 
However,  note  chatdh  *  dC(df  ,df  )  is,  in  general,  not  tangent  to  the  leaf. 
Its  projection  on  the  tangent  space  to  the  leaf  at  0  yields  the  tangent  vector 

dCfdp'.d  -  Xdf^  to  the  leaf  through  0,  for  some  X  c  R. 

Theorem  I :  If  f  is  an  observable  for  the  parameter  space  (0,G),  then  the 

points  of  extremal  sensitivity  with  respect  to  f  are  implicitly  determined  by 
the  equation 

dC(df#,df*)*  -  Adf*  -  0 

Proof:  TVe  stated  condition  is  the  Eu  ler-Lagrange  equation  for  the  constrained 

optimization  problem. 

The  gradients  of  T  and  f  are  aligned  at  the  extremal  sensitivity  points.  In 

particular,  for  the  uniform  metric,  g-j  ■  the  condition  specializes  to 

(W*)  '  XI)f0(,) " 0 

while  for  Che  relative  metric  gjj  •  d.j/0^0^,  which  is  useful  in  connection 
with  the  floating  point  arithmetic,  Che  condition  is 


[  diag(0  )diag(  ffl)  ♦  di  ag  (02 )  f..  ]d  iag(0  2 )  f  -  Xdiag(02)f 


BEST  AVAILABLE  C  Ti  >'i 


t.f.  I'crrtcst  and  tv'.  ,V.  (Jr:- 


In  th^  latter  case,  a  simpler  lor«n  ia  obtained  by  usinj.  the  r,e  r  a ! 

gradient  7f  with  component  a  O.Sf/30.  instead;  corresponding  to  tY>  ir.-y.-,,' 

Hessian  K  »  diag(Vf)  +  d  i.i8(0  )  f^diagO  ) .  We  state  what  was  just  ah-.wn 
itnporc  ant 

CoroU^rv:  The  extremal  sensitivity  points  of  (0,0,  where  G  is  the  unifo.-i 

relative  metric,  are  the  points  where  the  gradient  d C*  is  in  Che  eigenspac 
the  generalized  Hessian  operator  H  :  T^G  *  “  o ,  i.e. 

( H (  f )  -  ,\l)df#  -  0 


4.  APPLICATION  TO  ROBUST  REALIZATIONS 

Express  the  parameterization  in  terms  of  the  components  of  a  factorizat 
of  the  ayatem  Hankel  matrix  H  -  OR,  where  0  and  R  are,  reapectively , 
obaervability  and  reachability  matricea  of  the  realization.  The  Hankel  mat 
defined  as  a  map  with  domain  plays  the  role  of  a  mul  t  id  imens  in 

observable.  The  continuous  time  systems  design  under  the  uniform  metric 
discussed,  for  square  (p  »  m)  systems  only. 

De  f in i cions :  Let  L^fO,")  be  the  Hilbert  space  of  m-vector  functions  with  in 
product  <x(0,y(*)>  -  /*  x(t)'y(t)dt.  The  reachability  operator  £  : 

Rn  for  a  realization  (A,B,C)  is  defined  by  Ru(t)  -  f  eAtBu(t)dt.  Its  adjc 
1*  ia  the  operator  R*  :  Rn  +  :  R*x  -  B'e^x.  The  observabili 

operator  is  £  ;  Rn  *  lJ[0,-)  :  £x  -CeAcx.  Since  R_  and  £  have  a  fin; 

dimensional  range  and  domain,  respectively,  they  are  compact,  and  the 
composition  £R  is  also  compact  [7].  Finally,  we  introduce  the  Hankel  opera 

£  :  LjtO,-)  *  l£[0,»)  :  £u(t)  -  J*  h(  t+T  )u(t  )dr ,  where  h(t)  -  CeAcB.  It 

readily  verified  that  indeed  H  ■  OR.  An  operator  A  :L°(0,-)  ♦  l”[C 

satisfying  AA  ■  A  A  ■  Id  (the  Identity  operator)  ia  called  isometric, 
shall  also  assume  Chat  the  set  {e.}^  is  the  standard  basis  for  Rn  and  t 
the  functions  { ^ ^ ^  form  s  complete  orthonormal  basis  in  L”tO,«). 


~  ’  L  h«vl*u><*vl  -  "  [j  rijlei><*jl  9.  m  ^  Okil+feX-J 

The  matrix  representations  [h£j|,  (r^J,  and  (o^)  will  be,  respective 
denoted  by  M«t(£),  Hit(R),  and  Mat(O).  By  Vec(M),  we  mean  the  vector  formed 
stacking  the  elements  of  the  matrix  M  columnwise. 

It  is  now  possible  to  state  our  first  auxiliary  result: 

j££2£.:  Lec  E  :  l-j  *  L®[0,«)  be  such  chat  TrAE  -  0  for  all  iso*sef 

operators  A,  Chen  E  ■  0. 

Froo f :  Suppose  l  has  the  singular  value  decomposition  [7,  p.  2611 


"  l.  rij!'i><*jl 


BEST  AVAIL 


L fj  .  Ui.Xy,  I 
i  '  1  l  1 


are  nonnegative,  ve 


where  [u.f  and  { v .  ^  are  orthonormal  sets  in  then  choosing  A  as 

l  l  '  2 

IJvjXu  J  yields  E1K  *  0.  Since  Che  singular  values  0.  are  nonnegative,  ve 

oust  have  all  Q.  "  0  and,  hence,  E  “  0.  3 

In  order  Co  apply  the  theory  developed  in  the  previous  section,  we  consider 
the  a.'fine  space  formed  by  the  matrix  elements  of  C  and  R,  ao  that  the  param¬ 
eter  vector  is  O'  “  [  V_  c  (Mat  (£)  ' )  '  ,  Vec  (  Ma  t  (£) )  1  ]  .  Analogous  to  the  discrete 

case  [A],  ve  shall  consider  the  observables:  f .  (0 )  "  TrA(H-OR).  Denote  by 

m  if,)  the  leaf  on  which  f,  is  constant,  zero  sav,  then  ve  have  the: 
o  A  A  ' 

Theorem  2:  The  extremal  aensitivity  points  of  M  ( f .  )  have  the  property  that 

;  ~  °  A 

RR  -  ££.  | 

Proof:  Subatitute  the  bra-ket  expansions  in  the  expression  for  the  observable 

f(9),  and  use  the  orthonormal ity  of  the  bases.  This  reduces  the  continuous 

time  problem  to  Che  matrix  problem,  solved  in  [4],  where  it  was  shown,  based  on 

Che  Corollary  to  Theorem  1,  chat  the  extremal  sensitivity  points  satisfy 

MatCR)Mat(R)  '  -  Hat (0) 'Mat (0) 

Expressing  Mat  (£)Mat  (R_) '  and  Mat  (£) 'Mat  (£)  in  Che  basis  {e.}"^  gives  then  the 

it 

condition  in  terms  of  the  original  operators:  _R£  *  £  £. 

Corel  1  ary :  The  minimal  sensitivity  realizations  on  the  Cl  (R)  orbit  of  a 
minimal  realization  of  M  are  the  essentially  balanced  (i.e.  balanced  modulo  an 
orthogonal  t rana format  ion)  realizations. 

Proof:  Observe  first  that  the  condition  for  an  extremum  did  not  depend  on  the 

choice  of  A,  and  therefore,  must  be  true  for  all  isometries,  or  observables  f, 

A  . 

All  extremal  sensitivity  points  of  f^  belong,  therefore,  to  the  intersection 

n.M  (f,).  By  the  lemma,  Che  intersection  of  the  manifolds  H  ( f ,  )  is  Che 
A  o  A  o  A 

submanifold  characterized  by  £  *  OR,  i.e.  Che  orbit  of  the  system  with  Hankel 
operator  £  under  Che  action  of  Cl  (R).  Then,  by  Che  previous  theorem,  RR  “ 
£  £  ao  that 

<x,j£*y>  -  <x,0*0y>  Vx,y  t  r" 


which  leads  to  the  equality  of  Che  Reachability  and  the  Ob aervab i 1 i t y  Gramian. 
Realizationa  having  Chia  property  are  essentially  balanced,  as  an  orthogonal 
similarity  transformation  will  make  chem  truly  balanced  (equal  and  diagonal 
gramians)  [10, l A ] .  The  second  variation  property  shows  that  the  extremal 
aolutiona  obtained  correspond  to  minimum  sensitivity  solutions.  Finally,  all 
infinitesimal  variationa  in  the  parameters  of  the  factorizations  of  the  Hankel 
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matrix  lead  co  second  order  variations  in  H.  But  small  (first  order)  varia¬ 
tions  in  the  reachability  and  observability  matrices  are  themselves  linked  to 
first  order  variations  in  the  realization  parameters.  | 

As  shown  by  this  corollary,  it  suffices  to  find  an  essentially  balanced 
realization.  The  characterization  as  a  factorization  of  the  Hankel  matrix  is, 
therefore,  independent  of  the  size  of  the  Hankel  matrix  considered,  as  long  as 
it  specifies  the  given  input/output  relation. 
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ABSTRACT 

This  paper  discusses  sods  aspects  of  the 
design  problem  Involved  in  the  choice  of  a 
reallration  or  approximation  of  a  desired  system 
behavior  (as  for  Instance  dictated  by  the 
analytical  solutions  to  a  filtering  problem)  by 
parameters  that  can  only  be  approximately 
adjusted,  e.g.,  due  to  quantisation,  component 
tolerances  (analog  case)  and  finite  vordlength 
(discrete  case).  The  paper  first  eddresses  the 
mathemstical  characterisation  of  this  robustness 
problem,  and  its  solutions  under  various  criteria 
of  optimality.  Earlier  results  are  here  extended 
to  multi-mode  systems  which  can  arise  in  non¬ 
linear  approximation  problems.  The  feasibility 
of  this  approach  in  multi-mode  filtering  is 
shown,  and  is  illustrated  by  an  air-to-air 
tracking  example. 

1.  INTRODUCTION 

The  air-to-air  target  tracking  problem  is 
highly  nonlinear  because  of  the  nonlineer 
relations  between  measurements  and  dynamical 
states,  and  the  different  flight  regimes  that 
occur.  Differences  in  Hach  number  and  or 
geometry  of  the  target  Induce  large  changes  in 
the  dynamical  model.  A  good  knowledge  of  the 
dynamical  model  is  primordial  to  the  design  of 
good  tracking  filters,  as  the  predictive  behavior 
of  the  filters  are  determined  by  the  dynamics  of 
the  system.  Mach  number  changes  with  air 
density,  hence  eltitude,  and  velocity,  and  is 
therefore  coupled  to  the  position  and  momentum  of 
the  target.  These  are  of  course  state  components 
of  direct  interest.  As  the  target  is  maneuvring, 
perhaps  beyond  the  anticipation  of  the  tracker, 
its  trajectory  is  modeled  as  a  smooth  stochastic 
process,  the  statistics  of  which  are  clearly 
dependent  again  on  its  position  and  momentum,  as 
well  as  the  geometry  of  the  vehicle. 

This  paper  investigates  the  tracking  problem 
under  the  assumptions  that  the  data  sampling  rate 
is  sufficiently  high.  This  implies  small 
Increments  (as  compared  to  the  sixes  of  the 
domains  in  the  different  flight  regimes)  in  the 
state  variables  from  one  sample  to  another,  so 
that  the  same  flight  regime  can  be  assumed  over  a 
large  number  of  samples.  Under  this  condition 
the  system  is  reasonably  wall  approximated  by  a 
piecewise  affine  stochastic  system  [1].  The 
transitions  from  one  flight  regime  to  another  is 
determined  by  the  state  vector  itself.  Given 
enough  (good)  samples,  the  state  estimator  will 


have  a  good  performance  in  each  domain.  At  the 
transients  from  one  domain  to  another,  unmodeled 
uncertainty  would  be  introduced  because  of  the 
mismatches  of  the  updates  near  the  boundaries.  A 
good  filtering  schema  needs  likelihood  type 
methods  as  developed  in  [2-3]  to  deal  with  this 
additional  uncertainty.  This  typically  leads  to 
a  filter  composed  of  a  parallel  bank  of  Kalman 
filters,  together  with  a  likelihood  updating 
scheme.  In  this  paper,  the  assumptions  ensure 
that  the  time  spent  in  these  transition  regions 
is  relatively  email  statistically  speaking.  We 
simply  propose  to  artificially  reset  the  state 
covariance  whenever  e  state  domain  transition 
occurs.  This  simplifies  the  filter  from  the 
parallel  bank  plus  likelihood  estimator  to  a 
simple  sequentially  switched  estimator. 

The  new  feature  of  this  paper  is  the  optimal 
implementation  of  such  a  sequentially  switched 
filter  from  the  point  of  view  of  parameter 
sensitivity.  Section  2  describes  some  typical 
problems  in  the  implementation  of  systems,  l.e. 
the  minimisation  of  the  effects  of  component 
tolerances  for  analog  systems,  and  the  finite 
vordlength  effects  in  digital  systems.  In 
section  3  the  results  ere  extended  to  a  more 
general  type  of  systems:  the  switched  systems  and 
piecewise  linear  systems.  Finally  in  section  4, 
the  air-to-air  tracking  example  is  discussed. 

2.  OPTIMAL  IMPLEMENTATIONS  FOR  LINEAR  MODELS 

This  work  builds  on  the  earlier  work  on 
robust  design  problems  [4-5].  Consider  a  linear 
time  invariant  system  (A,B,C)  with  a  inputs,  p 
outputs  and  McMillan  degree  n.  This  may  be  a 
modal  of  a  system  to  be  faithfully  simulated,  the 
implementation  of  an  analog  or  discrete  filter, 
or  an  observer-controller  implementing  an  optimal 
regulator  for  a  given  plant.  As  in  all  these 
applications,  not  the  actual  state  coordinates, 
but  the  input-output  relation  is  important,  they 
are  usually  implemented  by  a  so-  called  canonical 
fora.  The  reason  for  this  is  that  these  imp- 
lamentations  minimite  the  number  of  parameters, 
and  allow  a  pipelined  realisation  of  the 
devices,  e.g.  the  "Direct  Fora"  realizations  in 
digital  signal  processing.  A  minimal  number  of 
parameters  corresponds  to  minimal  complexity,  an 
important  quality  if  operation  count  becomes 
important.  However,  a  minimal  set  of  parameters 
has  no  redundancy,  and  therefore  one  may  expect 
high  sensitivity  with  respect  to  these  para¬ 
meters  .  It  is  clear  that  the  freedom  of  coor¬ 
dinate  basis  of  the  implementation  should  be 
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utilised  to  determine  optimal  realisations  under 
various  criteria  for  optimality.  In  particular, 
two  Issues  seem  to  be  Important:  sensitivity  and 
clustering.  The  sensitivity  requirement  guaran¬ 
tees  robustness  of  the  actual  Implementation, 
while  clustering  deals  with  the  parameter  ranges. 
It  relates  to  the  problem  of  approximately 
implementing  a  certain  system  with  parameters 
chosen  from  a  finite  set  with  fixed  values. 

The  approach  taken  in  (4]  is  geometric.  The 
realisation  space  _  is  modeled  as  *■' 
n(n+m+p)  dimensional  'affine  space  with  an 
Euclidean  metric  metric  defined  in  the  tangent 
space  at  each  point.  The  Extremal  Sensitivity 
Theorem  assarts  that  the  minimum  sensitivity 
points  of  an  observable  are  the  points  where  the 
(generalised)  gradient  is  in  the  eigenspace  of 
the  (generalised)  Hessian.  In  the  case  of  fixed 
point  implementations,  the  uniform  Euclidean 
metric  is  appropriate,  and  the  gradient  and 
Hessian  correspond  with  the  the  usual  notions  in 
calculus.  All  results  are  therefore  also 
"infinitesimal*''.  One  can  reasonably  so  argue 
that  in  finite  wordlength  arithmetic,  the  notions 
of  infinitesimal  perturbations  do  not  apply,  as 
all  perturbations  are  due  to  the  truncation  of 
the  coefficients.  This  may  indeed  invalidate  the 
above  mentioned  method.  For  this  reason,  we 
shall  develop  the  analysis  for  non -Infinitesimal 
perturbations  in  this  paper.  It  will  be  shown 
that  this  approach  makes  a  connection  with  the 
notion  of  clustering.  In  this  section,  by  finite 
we  shall  mean  non- infinitesimal.  By  L  we  denote 
a  particular  realization  in  n  _,  and  by  M  the 
equivalence  class  of  all  similar  realizations 
having  a  particular  input-output  behavior,  i.e. 
the  orbit  i~^(L)  under  Gl^iR).  The  equivalence 
class  will  be  referred  to  as  "system”,  the  orbit 
space  is  denoted  by  „,  and  the  projection 
map  from  -  to  n  j  by  w.  Two  problems 
related  to  tne  sensitivity  and  robustness  are 
studied. 

Problem  A:  Discrimination:  How  do  we  choose 
a  realization  of  a  given  system  HQ  such  that  it 
is  maximally  distant  from  the  orbit  x~l(M),  where 
M  is  another  given  system. 

Problem  B:  Worst  Case  Defects:  If  the  system 
parameters  are  perturbed  over  a  fixed  non- infini¬ 
tesimal  amount,  and  if  f  is  a  scalar  system 
function  (i.e.  Invariant  under  similarity)  then 
find  the  realizations  L  of  M0,  for  which  the 
f-perturbation 

max  {  f(L+A)  -  f(L)  )  where  CA  ■  {  A  |  141  ■  1  } 
is  minimal. 

In  both  problems,  the  norm  function  in  the 
realization  space  will  be  fixed  to  be  the  Eislng 
norm  (compatible  with  the  uniform  metric). 

dK2  -  Tr  {  AA*  +  BB*  ♦  C'C  } 

It  follows  then  that  if  (A,B,C)  is  a  solution  to 
each  of  the  above  problems,  then  also  any 
realization  obtained  from  (A,B,C)  by  an  ortho¬ 
gonal  similarity  transformation  is  also  a 
solution.  A  third  problem,  related  to  problem  A, 


is  now  introduced: 

Problem  C:  Clustering:  find  the  realizations 
L  of  M  with  minimal  system  (Eislng)  norm. 

This  has  the  physical  significance  that 
cooperatively  the  components  of  the  realization 
are  as  small  as  possible,  hence  clustered  near 
zero.  It  is  a  special  case  of  the  more  general 
(and  more  significant)  clustering  problem.  Let 
r  ■  Yh)  be  a  finite  subset  of  R,  then  we 

formulate 

Problem  C‘:  T-Clustering-  Find  the  realiza¬ 
tions  L  of  M  with  component  values  closest  (in 
the  Elslng-norm  Induced  metric)  to  the  set  T  « 
(Yi . Yh)* 

Problem  C  corresponds  then  to  T  ■  (0).  It  is 
also  helpful  to  define  a  bilinear  map  on  : 

II...]]  -  L»,n>p  x  LBtn>p  — -  Rnxn 

U(A,B,C),(F,G,H)]J  -  [A'.F]  -  G  B*  +  C'  H 

where  l . , .  ]  is  the  usual  Lie  product : 

[A.F]  -  AF  -  FA 

Theorem  1:  The  class  of  optimally  clustered 

realizations  coincides  with  the  class  for  which 
[[L,L]]  vanishes. 

Proof :  If  (A,B,C)  is  constrained  to  realize 
H,  then  if  (Aq.Bq.Cq)  is  a  representant  for  H,  we 
get  a  constrained  optimization  problem,  for  which 
the  Hamiltonian  is 

H  -  Tr  {  AA*  ♦  BB*  +  C'C  ♦  AA(TA0-AT}*  + 
AB{TB0-B)'  ♦  A^Co-CT)  } 

The  optimality  conditions  lead  then  directly  to 
the  stated  result. 

Not  every  system  allows  an  optimally  cluster¬ 
ed  realization.  A  counter-example  can  be  found 
which  is  based  on  the  phenomenon  that  the  orbits 
under  similarity  are  not  closed  [6]. 

With  this  solution,  we  can  show  the  follow¬ 
ing: 

Theorem  2:  The  realization  of  M0  which  has 

maximal  Eislng  distance  to  the  orbit  of  is 
implicitly  given  by  L*  c  s*l(M0),  satisfying 

( [4* ,Le] ]  -  0 

4  optimally  clustered 

where  £  is  the  perturbation,  d.(L*,Le)  - 
with  L  e  s~HM;).  The  maximal  distance  is  then 
tbs  Bising  norm  of  4. 

Proof:  This  follows  easily  by  solving  the 

minimax  problem:  First  determine  for  e  given 
realization  L,  the  point  L*  on  the  orbit  of 
for  which  theEising  distance  dg(L,L*)  is  minimal. 
This  problem  has  always  a  solution,  but  the 
realization  L*  may  not  be  unique.  The  proof  is 
similar  as  in  the  clustering  theorem.  The 
condition  is  [[L,L-L*]J  •  0.  Next  we  slide  L  on 
its  orbit,  the  associated  realization  L*  will 
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clearly  vary  with  L.  so  that  we  nay  define  a  nap 

r.  a'^Hp) - *  e'HHi)  »  L - ►  L*.  Nov  find 

tha  similar  realisation  La,  for  which  dg(L*,L  ) 
is  maximal.  The  realisation  L*  is  the  cor¬ 
responding  point  (L*)tf.  This  constrained 
optimisation  oroblem  yields  tha  additional 
condition  [IL-L».L-L*]J  -  0. 

There  remain  some  open  problems.  It  is  not 
dear  whether  or  not  the  orbits  can  diverga,  in 
tha  sense  that  the  optimum  may  be  on  the  closure 
of  the  orbits,  and  therefore  not  attainable. 
Also,  if  a  solution  exists,  it  may  not  be  unique 
(modulo  0(n)).  The  example  of  x*l(0,l,l)  and 
,*1(1, 1,1)  illustrates  that  to  every  L  in  the 
first,  a  corresponding  L^  on  the  other  orbit 
exists,  for  which  the  distances  are  constant  and 
equal  to  1.  It  is  also  natural  to  look  at  tha 
extension  of  problem  A: 

Problem  A*  s  Multimode  Discrimination:  Cj.ven 
the  orbits  e'ML^),  find  the  realisations  L^  on 
each  orbit  such  that  the  set  {L*  )  is  maximally 

separatad. 

This  is  of  interest  for  realizing  multi  mode 
systems.  The  practical  significance  of  all  this 
is  that  the  realizations  with  the  largest 
intraset  distance  are  the  most  robust  with 
respect  to  parameter  inaccuracy,  as  for  Instance 
due  to  coefficient  truncation.  The  problem  will 
be  discussed  in  section  3.4. 

As  to  problem  B,  we  shall  Just  state  the 
following  results  for  the  scalar  observable 
fs  _  — ►  R  ,  which  in  fact  only  gives  an 

Implicit  solution  to  problem  B8 

Theorem  3;  i)  Let  the  realization  L  be 
given.  The  deviation  f(L+A)  -  f(L),  is  extremal 
if  the  perturbation  A  is  in  the  direction  of  the 
gradient  of  the  observable  f,  evaluated  at  L+A. 

li)  If  only  the  system  M  is  given,  i.e.  the 
orbit  e"l(L),  then  the  deviation  f(L+A)  -  f(L)  Is 
extremal  at  L*  if  the  perturbation  A*, 
grad  f(L*+A*)  and  grad  (Le)  are  all  aligned. 

Proof;  Again  this  follows  simply  from 
adjoining  the  constraints  IAI2  ■  1  and  for  part 
li)  also  f(L)  •  c,  with  Lagrange  multipliers  A 
(and  u  for  il)  )  to  the  performance  function 
f(L+A)-f(L).  The  optimality  conditions  are 


3.1  Switched  Systems 

The  system  is  assumed  to  be  modeled  by 

*k+l  ■  A[k]*k  +  Blk)«k 

?k  *  c{k)*k 

where  [k]  c  (1 . N)  is  the  function  determining 

the  mode  switched  on  at  time  k.  we  assume  that 
this  switching  la  state- independent,  but  other¬ 
wise  a  purely  deterministic  (sequence,  or  a 
random  time  series.  A  theory  for\  deterministic 
discrete  time  periodic  systems  was  developed  in 
[7].  Bare  we  allow  general  time  variation,  thus 
not  necessarily  periodically  switching  sequen- 
cas.  In  the  randomly  switched  case,  we  assume 
that  the  statistics  are  stationary  and  known. 
The  domain  of  validity  of  each  mode  is  the  entire 
state  space.  This  la  in  contrast  with  the  next 
class. 

3.2  Piecewise  Linear  Systems 

This  is  a  multi-mode  system  as  described 
above,  but  the  domains  for  the  validity  of  each 
mode  partitions  the  state  space,  i.e.  they  form  a 
"patchwork"  which  pieces  together  a  single 
"global"  system.  Clearly,  one  can  think  of  such 
a  system  as  a  switched  system  with  the  switching 
completely  determined  by  the  state  xk  of  the 
system.  Such  a  model  results  for  Instance  in  the 
approximation  of  a  nonlinear  system  by  a  piece- 
wise  linear  one  [1].  Despite  Its  local  linear¬ 
ity,  the  dynamical  behavior  of  such  a  system  can 
be  very  complex  and  sustain  chaotic  motion  [1]. 

3.3  Robust  Dealtn  Problems  for  Multi-Mode  Case 

As  usual,  the  problem  is  to  design  an  optimal 
Implementation  of  the  multi-mode  system. 
Characteristic  for  the  multi-mode  systems  is  the 
fact  that  the  state  is  communicated  from  one  mode 
to  tha  other.  This  implies  that  despite  the  fact 
that  each  mode  separately  can  be  realised  in  many 
different  ways,  the  total  system  is  only  left 
invariant  under  the  action  of  Gl^R),  and  not 
Gln(R),i,  where  N  is  the  number  of  modes.  Hence  a 
straightforward  optimization  by  realizing  each 
mode  in  tha  optimal  way  (i.e.  essentially 
balanced)  will  only  be  valid  if  a  state  trans¬ 
formation  is  performed  at  each  mode  transition. 
He  formalise  this  as 


grad  f(8+d)  +  AA  •  0 

grad  f(8+A)  +  (ii-A)  grad  f(B)  ■  0. 

Remark:  The  infinitesimal  result  of  (4]  is 
recovered  for  the  uniform  metric  if  the  pertur¬ 
bation  becomes  infinitesimally  small. 

3.  MULTI -MODE  SYSTEMS 

Two  types  of  models  with  many  similarities 
are  discussed;  Switched  parameter  linear  systems 
and  piecewise  linear  systems.  Each  "moda"  of  a 
system  will  be  denoted  by  a  triple  of  systems 
functions  (A^.B^.C^).  He  shall  also  assume  that 
the  number  of  different  modes  is  finite,  N. 


Theorem  4s  Defining  the  observable  for  the 
multimode  system  as  a  weighted  average  of  the 
observables  in  the  single  modes,  the  optimal 
unconstrained  realization  is  obtained  by  realiz¬ 
ing  each  mode  individually  in  essentially 
balanced  form. 

Proof;  The  observable  is 

f  -  Tr  E  e^Ag(B^-OgRg) 

which  is  a  weighted  sum  of  tha  observables 
defined  in  (4-5)  for  single  mode  systems.  The 
parameterization  is  with  respect  to  the  com¬ 
ponents  of  the  observability  matrices  Oj,  and  the 
reachability  matrices  Rj.  The  gradients  are 
linear  in  these  parameters,  and  the  Hessian 
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eiganproblem  therefore  dacouplaa  into  tha 
Individual  components,  giving  tha  simple  condi¬ 
tion  Oj'O^  “  EiRi',  expressing  aaaantial  balance- 
dnasa  of  all  aoda  realisations. 

If  all  sodas  communicats,  this  naans  that 
H(N-l)  transformation  matrices  naad  to  ba  storad. 
This  additional  computation  lnducas  also  in¬ 
accuracies,  and  may  tharafora  upset  tha  optimal¬ 
ity  for  that  schema.  Ua  present  hare  tha  more 
diract  approach  by  choosing  tha  optimality 
criterion  as  a  weighted  version  of  tha  objective 
in  each  mode.  Tha  weights  {s^}  are  most  reason¬ 
ably  sat  equal  to  tha  relativa  time  spent  in  each 
mode,  from  our  assumptions  these  relative  times 
are  pracomputable. 

Theorem  5;  Let  tha  relative  time  spent  in 
mode  i  ba  v^,  then  the  constrained  minimal 
sensitivity  realisations  are  given  by  the 
essentially  balanced  multi-mode  systems  defined 
by  tha  requirement: 

I  0i‘0i  -  £  Si2  R iV 

1  1 

Proof :  Follows  directly  from  the  &ST,  using 
the  observable 


realisations  Lj  on  each  orbit  such  that  the  sat 
(L  is  maximally  separated. ^,,---Tha  practical 
significance  of  all  this  is  that  the  realisations 
with  the  largest  Intraset  ,/distance  are  the  most 
robust  with  respect  to  /parameter  inaccuracy,  as 
for  Instance  due  to  coefficient  truncation.  Ue 
give  a  constructive  solution  of  problem  A'  in  the 
unconstrained  case,  ^.e.  when  the  states  do  not 
necessarily  have  to  cooounlcate  directly,  and 
transformations  are  allowable  at  each  mode 
transition.  j 

1.  For  each  realisation  L  on  Mq,  determine 
realisations  L^  on  the  orbits  of  tha  other 
modes  for  which  dp(L.Lj)  is  minimal.  Let  the 
minimal  distance  be 

2.  Determine  A(L,(H})  -  min  {  4^^:  1-1,..., N 

}.  Note  that  this  dlsance  does  no  longer 
vary  smoothly  as  L  moves  on  Mq. 

3.  Determine  L*  on  Mj  such  that 

A(L*,{M))  -  max  {4(L,{M})t  L  realises  H^. 

4.  Perform  steps  1-3  for  each  of  the  modes  Mj, 
to  find  tha  optimal  realisations  Lj*. 


*i  “  ®  Tr  A^CH^-O^R) 
i 

with  the  constraints: 

Oj  -  01°T*1  and  RA  •  TRj0 

where  Rj0  and  0*°  are  respectively  the  reach¬ 
ability  and  observability  matrices  for  a  nominal 
realisation,  and  T  is  the  Gl^R)  element  to  be 
determined,  l.e.  the  parameterlsatlon  for  the 
problem.  First  the  gradients  of  the  observable 
with  respect  to  R^  and  0^  are  computed,  and  the 
constraints  are  substituted.  Noting  that  the 
time  vi  spent  in  mode  EA  does  not  depend  on  the 
realisation  of  that  mode,  the  gradient  components 
are  readily  obtained, 

a0i  "  "i™!0*! 

8Ri  "  ,iAi°i°T~l 

Minimisation  of  the  norm  of  the  gradient  with 
respect  to  T  yields  then  the  condition 

TPT*  -  T^qT"1 


where  we  used  the  fact  Aj A^  -  A^'A^  -  I,  and 
defined  the  generalised  gramians,  weighted  by  tha 
sojourn-times  as 


P  -  £  Vj2  Rj,°Ri0' 
Q  -  £  e12 


Tha  optimal  T  is  then  simply  the  balancing 
transformation  for  P  and  Q,  which  can  always  be 
found  (8]. 


3.4  Hon-Inflnlteslmal  Perturbations  of  Multi-Mode 
Systems 


Because  of  tha  nondlfferentlable  structure,  the 
maximisation  in  step  3  cannot  be  performed  by 
simple  differentiation.  In  tha  constrained 
problem,  we  start  from  the  realisations  Lj,...,L2 

in  modes  Mj . M2  respectively,  and  solve  for 

tha  transformation  T  such  that  (in  the  notation 
of  the  unconstrained  problem) 
min  UdajMM}):  1-1 . N) 

is  maximised  over  T  c  Gl^iR).  Many  variants  of 
the  problem  can  be  defined.  For  instance,  the 
e, -weighted  average,  rather  than  the  minimum  of 
the  distances  4(T(Li),{M})  may  be  maximised. 

4.  APPLICATION  TO  FILTERING 
4.1  The  Model 

As  discussed  in  tha  Introduction,  we  shall 
assume  that  tha  nonlinear  dynamics  are  satls- 
factorllly  modeled  by  a  piecewise  linear  multi 
model  stocahstic  system,  thus  a  combination  of 
tha  systems  discussed  in  section  3.  Each  flight 
regime  corresponds  with  one  domain,  and  these 
domains  are  smoothly  patched  together.  Moreover, 
in  each  flight  regime,  we  also  assume  a  multi- 
-mode  model  because  of  the  variable  geometry. 
Tha  system  is  assumed  to  be  controlled,  the 
control  being  conditioned  on  tha  observations, 
and  therefore  deterministic.  This  is  super¬ 
imposed  on  tha  stochastic  inputs,  modeling  the 
noise  in  the  system  as  well  as  the  unpredictable 
component  of  the  motion  of  the  craft  to  be 
tracked. 

Tha  general  discrete  model  is 
*k+l  "  A[k}*k  ♦  *[k]dk  +  <*lk)*uk 
Tk  *  c[k]xk  +  D(kJ«k  +  *[k)*vk 


It  is  also  natural  to  look  at  the  extension  where  u  and  v  are  white  noise  processes,  modeling 

of  problem  A:  Given  the  orbits  w'Hl^),  find  the  the  measurement  and  dynamical  uncertainties  (a.g. 
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the  unknown  input*  due  to  the  unpredictable  P  *  E  end  Q  ■  E  Qj 

action  of  the  creft  to  be  trecked  are  typically 

aodelled  by  colored  noise,  the  noise  shaping  Pinally,  each  of  the  nodes  of  the  filter  is  then 

filter  i*  then  included  in  the  dynanical  aqua-  transformed  to  the  optimal  form.  Ws  have  worked 

tion).  d  is  the  deterministic  input,  which  is  a  out  the  ideas  for  discrete  time  filters.  The 

feedback  of  the  filtered  signal  and  some  oxter-  concept  works  Just  as  well  in  continuous  time 

tially  applied  known  component.  Offsets  (the  [5].  The  conditions  are  the  same  (l.e.  essential 

biases  due  to  an  affine  approximation  of  the  balancedness  of  the  averaged  system), 

nonlinearities)  can  be  modeled  in  these  terms  as 

mmII.  4.4  Subopt  1ms 1  Implementations 


4.2  The  Steady  State  Filter 

Under  the  above  assumptions,  a  steady  state 
Kalman  filter  approximation  is  implemented  in 
each  of  the  domains 

*k+l  “  A(k]*k  +  B(k]<1k  + 

K[kJ<yk-c[k]*k-DIk]*k) 

where  the  gains  are  computed  for  the  steady 

state.  This  of  course  requires  some  assumptions 
on  the  deterministic  signals  dk  and  *k.  Typical¬ 
ly  such  a  filter  is  used  in  a  feedback  scheme  in 
order  to  provide  the  control  command,  i.a.  wa 
also  have  an  "output"  equation 

dk  -  rk  -  M[kjxk 

which  generates  the  control  command.  The 

combined  equations  are  then 

ik+i  -  <A(k]-BfkiM{k}-Kfk]Crkl)ik  + 

B[k]rk  *  *[k]“[k]*k  +  *[k]yk 

i.a,  a  multi -mode  system 

*k+l  ”  ^Ik^k  +  c(klwk 
dk  *  H[k]*k  +  rk 
with  the  modes  defined  by 


*i  *  Ai  ’  BlMi  ’  Kici 
Gj  -  C  Bt  ;  -KiDi  5  Kt  ] 

Ht  -  -Ml 

and  the  input  wk  is  lrk' ,ek' ,yk' J 1 • 


4.3  The 


pigmentation 


The  equations  for  the  filter  modes  obtained 
in  the  previous  subsection  are  of  the  form  of  the 
multi-mode  systems  in  section  3.  The  results 
obtained  there  apply  therefore  directly.  In 
particular,  the  sojourn-times  {s^}  can  be 
estimated,  either  via  simulation  on  the  exact 
dynamical  (nonlinear)  system,  or  in  the  simpler 
cases,  by  direct  analysis.  The  Gramlans 
Qi  "  Oi'Oi  and  P*  "  8^'  can  be  computed  by 
solving  the  Lyapunov  equations 


Wl  +  CiG'i  “Pi 


F'iQiFi  +  H'iHi  •  Qi 


The  transformation  to  the  minimal  sensitivity 
coordinate  basis  is  then  obtained  by  balancing 
(7]  the  matrices 


While  the  optimal  implementation  is  based  on 
the  steady-state  filters  described  in  section 
4.2,  the  applicability  of  these  filters  is  not 
appropriate  for  the  stochastic  transition  case 
when  the  transitions  are  not  known  to  the 
observer.  The  reasons  for  this  is  that  the 
filters  were  derived  for  the  steady-state  case, 
which  assumes  long  sojourn  times  for  such  a 
steady-state  to  be  achieved.  The  results  elso 
assume  that  the  mode  of  the  system  may  be  known 
so  that  the  appropriate  filter  can  be  selected. 
Such  an  assumption  can  be  Justified  for  long 
sojourn  times  that  allow  mode  identification. 
Finally,  when  fast  transitions  among  the  modes 
can  occur,  a  steady-state  will  also  never  be 
achieved  for  any  of  these  states. 

In  this  case  a  suboptlmal  choice  of  the 
filters  is  considered  based  on  the  slxe  of  A 
small  parameter  d  is  selected,  and  the  modes  are 
classified  into  two  typesi  those  with  x1  >  6 
(slow  modes)  and  those  with  ij  <  4  (fast  modes). 
A  set  of  filters  as  shown  In  section  4.2  is 
designed  to  run  in  parallel  for  all  slow  modes, 
.and  the  correct  one  is  chosen  based  on  a  likeli¬ 
hood  function  that  is  based  on  these  steady-state 
filters  (the  transitions  are  ignored  to  avoid  the 
exponential  rise  in  complexity).  Two  alterna¬ 
tives  are  considered  for  the  fast  mode  filters. 
The  first  is  to  define  an  average  model  just 
based  on  these  modes  and  their  sojourn  times  Sj, 
and  us*  this  to  obtain  a  filter  for  these  modes 
with  corresponding  aggregated  likelihood  func¬ 
tion.  The  second  is  to  use  a  separate  filter  for 
each  mode  and  use  a  weighted  average  using  the 
lllellhood  functions  of  each  mode. 

In  these  cases  the  filters  will  be  as  given 
in  section  4.2  and  the  likelihood  function  is 
obtained  by  standard  expressions,  which  may  be 
modified  as  in  [2]  whan  the  multi-models  result 
from  nonlinear  approximations.  In  that  case  the 
regions  in  the  state  space  can  be  approximately 
found  f rod  the  estimates  and  combined  with  the 
statistical  expressions. 


4.5  Examplet  Two-Dimensional  Intercept  Problem 


In  this  t vo -donum* Iona  1  example  the  states 
represent  the  relative  positions  of  the  missile 
and  the  target,  which  may  be  given  as 


F«Vy 

V,  -  di  - 
Vy  -  d2  -  «2 
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where  x  and  y  ara  the  relative  positions  in  the 
(x,y)  plane,  ii  ara  the  control  forces  of  the 
missile  which  will  be  based  on  the  estimates  of 
the  states  assuming  separation  holds,  end  a^  ara 
the  maneuvering  acceleration  of  the  target  which 
may  be  modeled  by  a  first-order  Markov  process 

•i  “  ♦  «i.  1  -  1.2. 

The  objective  may  be  formulated  as  a  quadratic 
control  problem  except  that  the  observation  ara 
nonlinear  in  the  states,  namely 

“  (x2  +  y2)^  +  vj  ■  r  +  V| 

*2  m  tan"1  (y/x)  +  v2  “  ®  +  v2 

where  v  is  modelled  as  a  white  noise  with 
covariance  dependent  on  the  relative  distance. 

The  model  may  be  approximated  In  two  possible 
ways.  The  first  Is  to  define  new  states  Involv¬ 
ing  the  angle  and  the  range  and  these  will  lead 
to  a  nonlinear  model  for  the  state  equations. 
This  model  is  then  approximated  by  a  piecewise 
affine  multl-siode  set  of  equations.  The  second 
is  to  approximate  the  observation  functions  by 
piecewise  affine  multi-mode  system  with  linear 
state  model.  If  we  define  the  set  points  for  the 
approximations  as  r^  and  0^  than  the  observations 
will  be  given  approximately  by 

rj  ■  r^  +  cos  0A  (x-xj)  +  sin  0*  (y-y^)  ♦  wj 

*2  “  ei  +  (l/*i)  (c0*  *i  " 

sin  0A  (x-xA))  +  v2. 

The  multi -mode  filter  is  then  derived  using 
the  expressions  of  section  4.2.  The  control 
command  may  also  be  incorporated  in  the  design 
using  either  given  control  strategy  such  as 
proportional  navigation  or  suboptlmal  implementa¬ 
tion  of  an  optimal  quadratic  cost  state  feedback 
control  law.  The  result  can  be  evaluated  using 
simulation  of  the  system  under  several  engagement 
scenarios. 

3.  CONCLUSIONS 

The  optimal  sensitivity  properties  for  the 
uultl-mode  realisations  have  been  derived.  They 
extend  nicely  the  notions  of  Essentially  Balanced 
Realisations  derived  in  [4-3].  These  optimal 
realisations  have  been  applied  to  obtain  an 
optimal  implementation  of  a  simple  multi-mode 
filter,  which  allows  the  tracking  of  a  target 
with  low-complexity,  small  wordlangth  hardware. 
This  simple  multi-mode  model  can  be  justified  if 
the  sampling  rate  is  sufficiently  high.  More 
qusntitative  results  are  presently  under  inves¬ 
tigation  based  on  a  simple  two-dimensional 
tracking  example. 

We  have  restricted  our  discussion  to  square 
systems  (m*p)  and  minimal  realisations.  Exten¬ 
sions  of  the  theory  are  in  progress.  It  seams 
intuitively  clear  that  one  could  further  exploit 
the  redundancy  of  a  realisation  by  deliberately 
using  nonminimal  realizations.  A  heuristic 
argument  for  this  possibility  is  as  follows:  Let 
M  ■  ►n,  and  let  (A,B,C)  be  a  minimal  sensitivity 


minimal  realisation  of  a  system  H.  Construct  now 
k  different  realisations 

(Ai,B1,C1)  -  {T1ATj"1,TlB,CT1_1;  i-1 . k}. 

With  these  realisations  construct  the  nonminimal 
diagonal  realisation  of  order  N 

k  ■  diag  (Aa)  {  i  ■  vec(Bi)  j  C  ■  vecCCp/k 

If  the  T^'s  are  chosen  in  a  neighborhood  of  the 
identity,  such  that  the  rounding  errors  In  each 
component  system  are  Independent,  then  as  k  -— »  - 

I6ykl2  S  IACt  +  CtiJn  l2..,/k  |x<k>l2  — —  0 

It  is  possible  to  overparametelse  the  system  in 
order  to  obtain  minimal  sensitivity  realisations. 
Finally,  the  idea  in  the  proof  of  the  main 
sensitivity  theorem  leads  to  gradient  type 
algorithms  for  the  optimal  sensitivity  realisa¬ 
tions.  This  of  course  is  to  be  performed  off 
line,  during  the  design  stage,  and  poses  there¬ 
fore  no  restrictions  on  the  hardware.  Prelimi¬ 
nary  remarks  regarding  these  appear  in  [5]. 
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ABSTRACT 

This  paper  addresses  some  aspects  of  the  design  and 
implementation  of  multi-mode  systems,  under  finite 
precision  restrictions.  This  occurs,  for  instance, 
when  quantization  and  finite  wordlength  effects  need  to 
be  incorporated,  or  when  high  component  tolerances  in 
analog  designs  need  to  be  considered.  The  freedom  m 
the  design  is  exploited  in  order  to  obtain  the  reali¬ 
zations  closest  to  the  normal  or  desired  behavior, 
despite  Che  interference  of  quantization,  component 
tolerances  (analog  case)  and  finite  wordlength 
(discrete  case).  Our  interest  is  in  Che  mathematical 
characterization  of  this  new  type  of  robustness 
problem,  and  its  solutions  under  various  criteria  of 
optimality.  Earlier  results,  linking  these  optimal 
realizations  for  linear  time-invariant  systems  to  the 
Balanced  Realizations  are  here  extended  to  Che  multi¬ 
mode  and  general  time-varying  systems.  The  feasibility 
of  this  approach  in  multi-mode  stochastic  problems  is 
shown. 


1.  INTRODUCTION 

This  work  builds  on  our  previous  work  on  robust 
design  problems  for  single-mode  time-invariant  systems 
[4,5].  Consider  a  linear  time  invariant  system  (A,B,C) 
with  m  inputs,  p  outputs,  and  McMillan  degree  n.  This 
may  be  a  model  of  a  system  to  a  faithfully  simulated, 
the  implementation  of  an  analog  or  discrete  filter,  or 
an  observer/concrol ler  implementing  an  optimal  regu¬ 
lator  for  a  given  plant.  As  in  all  these  applications, 
not  the  actual  state  coordinates,  but  the  input-output 
relation  is  important,  they  are  usually  implemented  by 
a  so-called  canonical  form.  Reason  for  this  is  that 
these  implementations  minimize  the  number  of  param¬ 
eters,  and  allow  a  pipelined  realization  of  Che 
devices,  e.g.  the  "Direct  Form"  realizations  in  digital 
signal  processing.  A  minimal  number  of  parameters 
corresponds  to  minimal  complexity,  an  important  quality 
if  operation  count  becomes  important.  However,  a 
minimal  set  of  parameters  has  no  redundancy,  and 
therefore,  one  may  expect  high  senaicivity  with  respect 
to  these  parameters.  It  it  clear  that  the  freedom  of 
coordinate  bms  of  the  implementation  should  be 
utilized  to  <etermine  optimal  realizations  under 
various  criteria  for  optimality.  In  particular,  two 
issuea  seem  to  be  important:  sensitivity  and 
clustering.  The  sensitivity  requirement  guarantees 
robustness  of  the  actual  implementation,  while 
clustering  deals  wich  the  parameter  ranges.  It  relates 
to  Che  problem  of  approximately  implementing  a  certain 
system  vith  parameter  values  chosen  from  a  finite  act. 

These  results  will  be  extended  here  to  multi-mode 
systems.  These  systems  have  recently  become  of 
interest  as  models  for  multi-race  systems  [10], 
nonumformly  sampled  continuous  systems  and  as  approxi¬ 
mations  to  nonlinear  systems  (2).  Several  interesting 
aspects  (e.g.  reachability)  have  been  scudied  for  these 


systems,  and  connections  with  other  fields  of  study 
(Yang-Mills  Theory)  have  recently  been  made  [9].  The 
next  section  poses  the  main  design  problems  in  the 
geometric  framework.  Emphasis  is  here  given  to  Che 
noninfinitesimal  perturbations,  the  results  on 
infinitesimal  perturbations  being  presented  earlier 
[4,5],  The  interest  is,  of  course,  to  problems 

involving  quantization  and  finite  wordlength  effects 
in  digital  data  processing,  where  a  random  variable 
approach  to  Che  problem  may  lead  to  unsuccessful 
modeling  of  its  behavior,  as  for  instance  illustrated 
in  [11].  Section  3  then  goes  on  with  the  application 
of  Che  geometric  theory  to  multi-mode  systems.  Two 
types  are  discussed  in  detail:  the  switched  systems 

and  the  piecewise  linear  systems.  The  general  time- 
varying  case  is  presented  in  Section  4.  In  Section  5, 
implementations  of  approximate  filters  are  discussed, 
based  on  a  piecewise  linear  approximation  of  the 
nonlinearity,  and  their  optimal  implementation  as 
discussed  in  Section  3. 


2. 


A  CEOKETRIC  APPROACH  TO  OPTIMAL  IMPLEMENTATIONS : 
NON INFINITESIMAL  THEORY 


The  approach  taken  in  [4]  is  geometric.  The 
realization  space  n  _  is  modeled  as  an  i,.n+m+p) 
dimensional  affine  ^pa’de  with  an  Euclidean  metric 
defined  in  Che  tangent  space  at  each  point.  The 
Extremal  Sensitivity  Theorem  [4]  asserts  chat  the 

minimum  sensitivity  points  of  an  observable  (a  smooch 
map  from  I.  n  to  R)  are  the  points  where  the 

(general ized;’  gradient  is  in  the  eigenspace  of  the 

(generalized)  Hessian.  In  the  case  of  fixed  pome 
implementations,  the  uniform  Euclidean  metric  is 
appropriate,  and  the  gradient  and  Hessian  correspond 

vith  the  usual  notions  in  calculus.  All  results  are, 
therefore,  also  "infinitesimal."  One  can  reasonably 
argue  that  in  finite  wordlength  arithmetic,  the  nocions 
of  infinitesimal  perturbations  of  the  coefficients  are 
meaningless.  This  may  indeed  invalidate  the 

application  to  finite  wordlength  effects.  For  this 
reason,  shall  develop  the  analysis  for  noninfini- 
tesimal  perturbations  in  this  paper.  It  will  be  shown 
that  this  spproach  makes  a  connection  with  the  notion 
of  clustering.  In  this  paper,  by  finite  we  shall  mean 
noninfinitesimal.  By  L  ve  denote  a  particular  realiza¬ 
tion  in  and  by  M  the  equivalence  class  of  all 

similar  realizations  having  particular  inpuc-output 
behavior,  i.e.  the  orbit  i  (M)  under  CLn(R).  The 
equivalence  class  vill  be  referred  to  as  "system,"  the 
orbit  space  is  denoced  by  MB  n  and  the  projection 
map  from  -  -  to  M^  n  «  by  "*  study  two  problems, 
which  are  r^lifed  to  ttt  sensitivity  and  robustness. 


Problem  A:  Discrimination.  How  do  we  choose  a 

realization  of  a  given  system  MQ  _juch  that  it  is 
maximally  distant  from  the  orbit  e  (M) ,  where  M  is 
another  given  system? 

Problem  1:  Worst  Case  Defecte.  If  the  system 

parameters  are  perturbed  over  a  fixed  noninfinitesimal 
amount,  and  if  f  is  a  scalar  system  function  (i.e. 


mvananc  under  similarity) ,  Chen  find  Che  realizations 
L  of  M0  for  which  Che  f-perturbacion 

max  {f(L+A)  -  f  ( L ) }  where  •  { A  [  Id  I  «  l} 

CA 

is  ninimsl . 


Theorem  2.  The  realizacion  of  M0  which  has  maximal 
Eiting  di'tfance  Co  Che  orbic  of  Mj  is  implicitly  given 
by  l*  t  i  (MQ),  aatisfying 

na'.lji  -  o 


In  boch  problems,  Che  norm  funccion  in  Che 
realizacion  space  will  be  fixed  co  be  che  Eising  norm 
(corapacible  wich  che  uniform  mecnc). 

ILI 2  -  I (A, B , C ) I  2  -  Tr{ AA1  ♦  BB ’  ♦  C'C} 

Ic  follows  Chen  chac  if  (A,B,C)  is  a  solucion  Co  each 
of  che  above  problems,  chen  also  any  realizacion 
obcained  from  (A,B,C)  by  an  orchogonal  similancy 
cransformacion  is  also  a  solucion.  Before  outlining 
che  solucion  co  che  above  problems,  we  shall  incroduce 
a  chird  one,  co  which  problem  A  is  relaced: 

Problem  C:  Clustering.  Find  che  realizscions  L  of  M 
wich  minimal  syscem  (Eising)  norm. 


This  has  che  physical  significance  chac  cooper- 
acively  che  componencs  of  che  realizacion  are  as  small 
as  possible,  hence  clustered  near  zero.  Ic  is  a 
special  case  of  the  following  more  general  (and  more 
significant)  clustering  problem.  Lee  T  ■  {y  j ,  . . .  ,y  J  be 
a  finite  subsec  of  R,  Chen  we  formulate: 


Problem  C':  T -Clustering.  Find  che  realizations  L  of 
N  wich  componenc  values  closest  (in  Che  Eising  norm 
induced  mecnc)  to  che  sec  T  • 

1  M 


Problem  C  corresponds  then  to  T  •  {o}.  It  is  also 
helpful  to  define  a  bilinear  map  on  1*bi  n  p: 

If.,.)]  :  L  xL  — *  Rn*" 

m,n,p  m,n,p 

[ [ (A, B.C) , (F,C,H) ) ]  *  [A 1 , F ]  -  CB'  ♦  C'H 


where  [.,.]  is  Che  usual  Lie  product:  [A,F]  •  AF  -  FA. 


A  optimally  clustered 

.  +  V  ★ 

where  d.is  the  perturbation,  d_[L  ,L*J  *  IL  -L*l  ,  with 
L  t  t_1(Hj).  The  maximal  distance  is  Chen  che  Eising 
norm  of  d . 

Proof.  This  follows  easily  by  solving  che  minimax 
problem:  First  determine  for  a  gijjen  realization  L  of 

Mg,  Che  point  L*  on  the  orbic  t  (M^ )  for  which  che 
Eising  discance  dg(L,L’)  is  minimal.  Thi^  problem  has 
always  a  solucion,  buC  che  realizacion  L  may  not  be 
unique.  The  proof  is  similar  as  ^in  Che  clustering 
Cheorem.  The  condicion  is  [[L,L-L  ]  J  *  0.  Nexc  we 
slide  L  on  ics  orbic,  Che  associated  realization  L" 
will  clearly  vary  with  L,  so  Chat  we  may  define  a  map 

»-1(M0)  — >  s_1(Mj)  :  L — •>  L#.  Now  find*  the 
similar  realizacion  L*1<k  for  which  d£(L*,L  )  is 
maximal.  The  realizacion  L  is  che  corresponding  point 
(L*)  .  This  conscrained  optimization  problem  yields 
Che  additional  condicion  [ { L— L  ,L-L  H  0.  • 

There  remain  some  open  problems.  Ic  is  not  clear 
wheCher  or  noc  Che  orbits  can  diverge,  in  Che  sense 
chac  che  optimum  may  be  on  the  closure  of  che  orbics, 
and  cherefore  noc  actainable.  Also,  if  a  solucion 
exiscs,  it  m^y  noc  be  unique  (modulo  0(n)).  The 
example  of  s  (0,1,1)  and  x  (1,1,1)  illustrates  chac 
to  every  L  in  the  first,  a  corresponding  L*  on  the 
other  orbic  exists  for  which  the  distances  are  constant 
and  equal  to  1.  It  is  also  natural  co  look  at  Che 
«xcen»ion  of  problem  A. 

Problem  Multi-Mode  Discriminaticyi.  Given  che 

orbics  s  (L),  find*the  realizations  L  on  each  orbic 
such  that  Che  sec  {L^}  is  maximally  sepaVaced. 


Theorem  1.  The  class  of  optimality  clustered  realiza¬ 
tions  coincides  with  the  class  for  which  [[L,L]J 
vanishes . 

Proof.  If  (A,B,C)  is  constrained  co  realize  M,  then 
if  (A0,B0,C0)  is  a  representative  for  M,  we  get  a 
conscrained  optimization  problem,  for  which  the 
Hamilconian  is 

H  •  TrfAA*  ♦  BB'  ♦  C'C  ♦  {tAq-AT}  ' 

♦  ab1tV8}’  *  ac1c0-ctH 

The  opcimalicy  conditions  lead  then  directly  to  the 
stated  result.  • 

Noc  every  syscem  allows  an  optimally  clustered 
realization.  A  counter-example  is  Che  system 

A  *  [i  ?]  » ■  [1]  c  -  n  i) 

Converging  sequences  of  transformations  can  be  found 
which  yield  equivalent  syscems  wich  decreasing  Eising 
norm,  but  che  limit  realizacion  is  noc  a  point  on  che 
orbit  of  the  given  realizacion.  It  it  a  known 
phenomenon  chat  Che  orbics  under  similarity  are  noc 
closed  (6j. 

■•mark.  A  realizacion  is  optimally  clustered  iff  there 
exists  an  orchogonal  cransformacion  S  c  0(n*m),  such 
that  (A,B]S  •  (A'.C'l. 


This  is  of  interest  for  realizing  multi-mode 
systems.  The  practical  significance  of  this  all  is 
that  the  realizations  with  the  largetc  incraset 
discance  are  the  most  robust  with  respect  to  parameter 
inaccuracy,  at  for  instance  due  to  coefficient 
truncation.  The  problem  will  be  discussed  in 
Section  3.4. 

As  Co  Problem  B,  we  shall  just  tcace  che  following 
results  for  the  scalar  observable  f:  1^  — >  I, 

which  in  fact  only  gives  an  implicit  ablution  to 
Problem  B; 

Theorem  3. 

(1)  Let  Che  realization  L  be  given.  The  deviation 
f(L*A)  -  f(L),  is  extremal  if  Che  perturbacion  A 
it  in  che  direction  of  the  gradient  of  the  observ¬ 
able  f,  evaluated  at  L+A . 

(2)  If.  only  the  system  M  is  given,  i.e.  che  orbit 

»  1(M),  then  the  deviation  f(L*A )  -  f(L)  is 

extremal  at  L*  if  che  perturbation  A*, 

grad  f(L**A*)  and  grad  (L*)  are  all  aligned. 

Proof.  Again  this  follows  simply  from  adjoining  che 
constraints  IAIZ  ■  1  and  for  part  (2)  also  f(L)  ■  cat, 
with  Lagrange  multipliers  A  (and  u  for  (2))  Co  the 
performance  funccion  f(L*A)  -  f(L).  The  optimality 
condicions  obcained  by  nulling  the  parcials,  with 
respect  Co  A  and  L,  are 


With  this  solution,  we  can  show  the  following: 


grad  f(9+A)  ♦  AA  *  0 

grad  f(9*A)  ♦  (u-1)  grad  f(9)  *  0 


• 

Remark.  The  infinitesimal  reault  of  [4]  is  recovered 
for  the  uniform  metric  if  the  perturbation  become* 
lnfiniteaimal 1  sail. 

3.  MULTI-MODE  SYSTEMS 

A  multi-mode  aystera  i*  in  effect  a  time-variant 
syatem.  However,  the  term  will  be  uaed  to  deaignate 
the  particular  caae  where  the  time  apent  by  the  ayatem 
in  each  mode  ia  significantly  longer  than  the  dynamical 
characteriat ic  times  (e.g.  time  conatant*  and  oacilla- 
tion  period*),  in  each  mode.  Heuriadcally  speaking,  a 
multi-mode  ayatem  behave*  locally  (in  a  temporal  aenae) 
like  a  time-invariant  ayatem.  Each  "mode"  of  a  ayatem 
I  will  be  denoted  by  a  triple  (Al,B1,Cl)  of  nxn,  nxm, 
and  pxn  matncea,  respectively.  We  shall  also  assume 
that  the  number  of  different  modes  is  finite,  N  say. 
(Although  the  theoretical  development  remain*  valid 
with  a  countable  set  of  modes,  its  justification  in 
finite  time  data  processing  is  elusive.) 

This  is  in  contrast  to  fast  switching  (Section  4). 
In  this  case  it  is  well  known  chat,  for  instance,  Che 
stability  properties  are  not  directly  determined  by 
the  individual  dynamical  interests  At .  Two  types  of 
models,  with  many  similarities  are  discussed:  switched 
systems  and  piecewise  linear  systems. 

3. 1  Switched  Systems 

The  system  is  assumed  to  be  modeled  by 

Vi  ■  A[k]\  +  B[k]\ 


consecutive  samples  of  the  pulse  response  of  the  multi- 
mode  system,  given  Chat  the  input  pulse  occurred  while 
the  system  was  in  mode  i.  If  this  initial  pulse  time 
is  uniformly  distributed  ir  the  interval  (0,Tl),  where 
Tj  is  the  duty  time  of  mode  i,  then  the  two  Hankel 
matrices  will  be  equal  with  "probability"  T  -2k*l/Tx 
as  long  as  2k  is  less  than  T  .  Note  chac  the  above 
probability  converge*  Co  1  if  k/TI  decreases.  In  this 
case,  a  weighted  average  of  Che  observables  corre¬ 
sponding  to  each  mode  is  justified,  with  the  weights 
proportional  to  the  ducy  cycles,  or  expected  sejourn 
times  of  the  system  in  the  respective  modes.  This 
yields  then  at  once  Che  extension  of  the  sensitivity 
theorem  in  (4): 

Theorem  4.  Defining  the  observable  for  the  mulct-mode 
system  a*  a  weighted  average  of  the  observables  in  the 
single  modes,  the  optimal  unconstrained  realization  is 
obtained  by  realizing  each  mode  individually  in 
essentially  balanced  form. 

Proof.  The  observable  is 

f  -  Tr  I  i.A.(H-OR) 

it  l  it 
l 

which  is  a  weighted  sum  of  the  observables  defined  in 
(4,5)  for  single-mode  systems.  The  parameterization  is 
with  respect  to  the  components  of  the  observability 
matrices  0.,  and  Che  reachability  matrices  IT.  The 
gradients  are  linear  in  these  parameters,  and  the 
Hessian  eigenproblem,  therefore,  decouples  into  the 
individual  components,  giving  the  simple  condition 


expressing  essential  balancedness  of  all  mode 
realization*.  • 


where  [k]  c  {i,...,n}  is  che  function  determining  the 
mode  switched  on  at  time  k.  We  assume  Chat  this 
switching  is  state-independent,  but  otherwise,  can  be  a 
purely  deterministic  sequence,  or  a  random  time  series. 
A  theory  for  deterministic  discrete  time  periodic 
systems  was  developed  in  [7].  Here  we  allow  general 
time  variation,  thus  not  necessarily  periodically 
switching  sequences.  In  che  randomly  switched  case,  we 
assume  chac  the  statistics  are  stationary  and  known. 
The  domain  of  validicy  of  each  mode  is  all  of  Rn,  i.e. 
the  whole  state  space.  This  is  in  contrast  with  che 
next  class. 

3.2  Piecewise  Linear  Syatem* 


There  is  one  problem  with  the  above  approach.  By 
individually  optimizing  each  mode,  there  will  be  no 
c  o  ramon  base  for  the  state  spaces  in  each  mode.  This 
means  that  if  at  time  T  a  mode  switching  occurs  from  i 
to  j,  the  state  existing  at  time  T  in  mode  i,  , 
cannot  be  directly  used  as  "initial  condition"  (ac 
time  T)  for  the  system  in  mod*  j,  but  need*  to  be 
transformed  first  Co  che  proper  coordinates.  If  all 
mode*  communicate,  (i.e.  if  all  mode  transition*  are 
present  or  possible),  it  means  chat  N(N-l)  transfor¬ 
mation  matrices  need  to  be  scored.  This  sec  consists 
of  pairs  of  mutual  inverse*.  If  only  cyclic  shifts 
occur,  N  transformations  suffice.  Besides  this 

required  overhead  in  memory,  Che  additional  computa¬ 
tions  induce  also  inaccuracies,  and  may  therefore  upset 
the  optimality  for  Chat  scheme. 


This  is  a  multi-mode  system  as  described  above, 
but  che  domains  for  the  validity  of  each  mode  partition 
the  state  space,  i.e.  they  form  a  "patchwork"  which 
piece*  together  a  single  "global"  syatem.  Clearly,  one 
can  Chink  of  such  a  system  as  a  switched  system  with 
Che  switching  completely  determined  by  che  state  x^  of 
the  system.  Such  a  model  results,  for  instance,  in  the 
approximation  of  a  nonlinear  system  by  a  piecewise 
linear  one  [1],  Despite  its  local  linearity,  che 
dynamical  behavior  of  such  a  system  can  be  very  complex 
and  sustain  chaotic  motion  Details  are  presented  in 
[3]. 

3.3  to  bust  Implementation 

The  problem  it  to  design  an  optimal  implementation 
of  the  multi-mode  system.  As  a  first  approximation  to 
the  optimal  realization,  each  mode  can  be  realized  in 
minimal  sensitivity  form  as  presented  in  [4],  This  is 
justified  by  Che  following  argument.  Let  be  the  pxp 
block  Hankel  matrix  of  the  Markov  parameters  in  mode  i, 
and  consider  the  Hankel  matrix  formed  by  the  2  k.  — 1 


A  direct  approach  exists  by  solving  the  problem 
with  the  same  optimality  criterion  as  in  Theorem  4 
(i.e.  a  weighted  version  of  the  objective  in  each 
mode),  but  with  the  additional  constraint  that  the 
state  is  communicated  from  on*  mod*  to  the  ocher.  This 
implies  chac  despite  the  fact  that  each  mode  separately 
can  be  realised  in  many  different  ways,  the  total 
system  is  only  left  invariant  under  Che  action  of 
Gln(R),  and  not  Cln(R)N,  where  N  is  the  nui  ber  of 
modes.  The  weights  {»  }  are  again  set  equal  o  our 
assumptions  these  relative  times  are  precomputable. 

Theorem  5.  Let  the  relative  time  spent  in  mode  i 
be  w  ,  then  the  constrained  minimal  sensitivity 
realizations  are  given  by  the  essentially  balanced 
multi-mode  systems  defined  by  the  requirement: 

I  i20'0.  •  I  *ZR  R'  . 

.ill  t  i  i  i 

Proof.  Follows  directly  from  the  EST,  using  the 
observable 

f  •  I  *  TrA  (H  -0  R  ) 

t  till 


with  che  constraints: 


Bin  {4(T(Li)1{M))  ;  i  -  1 . Nj 
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where  R°  and  0°  are,  respectively,  che  reachability 
and  observability  matrices  for  a  nominal  realization, 
and  T  ia  the  GLn(R)  element  to  be  determined,  i.e.  the 
parameterization  for  the  problem.  Firsc,  the  gradienta 
of  the  obaervable  with  reapect  to  Rj  and  CT  are 
computed,  and  the  conacrainta  are  aubaticuted.  Noting 
that  che  cime  *  apent  in  mode  I  .  doea  not  depend  on 
the  real ization  lof  the  mode,  the  gradient  component* 
are  readily  obtained, 


J.  -  *  TR  A 
Oi  ill 

a„  -  s  A  0OT_1 
Rl  111 

Minimization  of  the  gradient  norm  with  reapect  to  T 
yields  then  the  condition 

TPT'  -  T'V’1 


where  we  uaed  the  fact  A  A1  •  A 'A  •  I,  and  defined  the 
generalized  gramians,  weighted  by  the  ae journ-c imea  aa 


I  « 


2  o  o 
R  R 

ill 


The  optimal  T  is  then  simply  the  balancing  transfor¬ 
mation  for  P  and  Q,  which  can  always  be  found  (8).  • 


it  maximized  over  T  t  Cl  (R).  Many  variants  of  the 
problem  can  be  defined,  "For  instance,  che  ^-weighted 
average,  rather  than  the  minimum  of  the  distances 
) , { m} )  may  be  maximized. 

4.  GENERAL  TIXZ-VAJtYINC  SYSTEMS 


Consider  now  the  general  time-varying  case,  with 
realization  {(A. ,B  ,C.)  ;  ic 2} .  The  dynamics  are 
completely  spec1!  f  i*ed  1  by  the  response  matrices  H(k), 
whose  ij-elements  are  the  response  at  time  k+i  to  an 
impulse  at  time  k*j-l 


**i  j  ^  ^k*iAk*i-lAV*i-2‘‘'Ak*j*]\*jBk*j-l 

Just  at  in  [4],  the  optimal  implementation  will  be 
determined  by  the  optimal  factorization  of  H(k)  into  a 
local  (at  time  k)  observability  and  reachability 
matrix,  0(k)  and  R(k),  for  Che  time-varying  system; 


H(k)  »  0(k)R(k) 


k*l 

Ck*2Ak*l 


[Bk,AkBk-l' ••  • 1 


Defining  the  obaervable  fk  as  TrAk(0(k)R(k)-H(k))  ,  che 
only  difference  wi t h  the  development  in  [4]  lies  in  the 
interpretation  (i.e.  it  it  now  the  local  observable  for 
a  time-varying  system).  The  mathematics  carry  through 
in  a  straightforward  manner.  The  extremal  sensitivity 
realization  is  then  again  determined  from  the  encerion 


R(k)R*(k)  -  O' (k)O(k) 


3.4  Noninfinitesimal  Perturbations 
of  thilti-Hode  Systems 


which  means  that  Che  realization  must  be  locally  (at  k) 
essentially  balanced.  This  result  leads  direccly  to: 


It  is  also  natural  to  look  at  _Jhe  extension  of 
Problem  A:  *  Given  che  orbits  *  (M  ) ,  find  „  the 

realizations  L  on  each  orbit  such  thac  trie  sec  (L  )  is 
maximally  separated.  The  practical  significance  of 
this  all  is  thac  che  realizations  with  che  largest 
incraset  distance  are  the  most  robust  with  respect  to 
parameter  inaccuracy,  as  for  instance  due  to  coeffi¬ 
cient  truncation.  We  give  a  constructive  solution  of 
Problem  A'  in  the  unconstrained  case,  i.e.  when  the 
states  do  not  necessarily  have  to  communicate  direccly, 
and  transformations  are  allowable  at  each  mode 
transition: 

(1)  For  each  realization  L  on  MQ,  determine 
realizations  L^  on  the  orbits  of  the  other 
modes  for  which  dg(L,L  )  is  minimal.  Let  the 
minimal  distance  be 

(2)  Determine  A(L,{m})  ■  mmfb  (M.)  ; 

i  "  1,...,n}.  Note  that  this  distance  does 
no  longer  vary  smoothly  as  L  moves  on  M0  (it 
is  not  differentiable  at  che  crossovers). 

(3)  Determine  L*  on  Mq  such  that 

4(L*,(m}>  ■  max  {a(L,{m})  ;  L  realizes  Mq)  . 

(4)  Perform  Steps  1  to  3  for  each  of  th|  modes 
Mt ,  to  find  che  optimal  realizations  L^ . 


Theorem  6.  A  time-varying  system  has  a  realization  of 
minimal  sensitivity  which  is  essentially  balanced  in 
che  cime-varying  sense  [8], 

Proof.  Define  as  Che  system  observable  a  uniformly 
weighted  sum  of  che  local  observables  defined  above, 
i  .e . 

l  N 

f  •  lim  zjj-T  l  TrA  .  ( H( i >-0 ( i )R ( i ) ) 

N—  i— N  1 

The  parameterization  of  the  realizations  is  with 
respect  to  the  local  observability  and  reachability 
matrices.  By  Theorem  4,  the  optimality  conditions  are 

0(i)*0(i)  •  R( l )R ' ( i ) 

i.e.  equality  of  the  local  observability  and 
reachability  gramians.  These  local  gramians  can  be 
simultaneously  diagonalized  by  an  orthogonal 
transformation.  The  corresponding  realization  is  the 
time  variant  analog  of  the  balanced  realization,  and 
its  existence  is  proven  in  (8).  As  the  condition  only 
expresses  equality  and  not  diagonality  of  che  gramians, 
the  extended  notion  of  "essential  balancedness ,"  i.e. 
the  balancedness  up  to  an  orthogonal  (now  time-varying) 
transformation  is  agsin  sufficient.  • 

3.  APPLICATION  TO  NONLINEAR  STOCHASTIC  CONTROL 


Because  of  Che  nondi f ferenc isbie  structure,  the 
maximization  in  Step  3  cannot  be  performed  by  simple 
differentiation.  In  che  constrained  problem,  we  start 
from  the  realizations  L,,...,L2  in  modes  Mj,...,Mj, 
respectively,  and  solve  tor  the  transformation  T  such 
that  (in  the  notation  of  che  unconstrained  problem) 


5.1  The  Model 

As  discussed  in  the  introduction,  we  shall  assume 
Chat  Che  nonlinear  dynamics  are  satisfactorily  modeled 
by  a  piecewise  linear  multi-model  stochastic  syscem 
[2],  thus  a  combination  of  the  systems  discussed  in 
Section  3.  The  syscem  is  assumed  to  be  controlled,  che 


control  being  conditioned  on  the  observations,  end 
therefore  deterministic.  This  is  superimposed  on  the 
•tochestic  inputs,  modeling  the  noise  in  the  system  es 
well  es  the  unpredictable  components  of  the  inputs,  due 
to  coupling  with  unmodeled  dynamics. 

The  general  discrete  model  is 

xk*l  "  A[k)xk  *  B(k)dk  *  Qik]uk 


yk  '  C(k]xk  *  D[k]ek  *  R(k]vk 

where  u  and  v  are  white  noise  processes,  modeling  the 
measurement  and  dynamical  uncertainties  (e.g.  the 
unknown  inputs  due  to  unmodeled  dynamics  are  typically 
modeled  by  colored  noise,  the  noise  shaping  filter  is 
Chen  included  in  the  dynamical  equation).  d  is  the 
deterministic  input,  which  is  a  feedback  of  Che 
filtered  aignal  and  some  externally  applied  known 
component.  Offsets  (the  biases  due  to  an  affine 
approximation  of  the  nonl  meant  les)  can  be  modeled  in 
these  terms,  d  and  e,  as  well. 

5.2  The  Steady  State  Filter 


Under  the  above  assumptions,  a  steady  state  Kalman 
filter  approximation  is  implemented  in  each  of  Che 
domains 


A[k]xk 


*  B[kldk 


<yt-cr 


]xk"D[k]eJ 


where  the  gains  are  compuced  for  the  steady  state. 
This  uf  course  requires  some  assumptions  on  the  deter¬ 
ministic  signals  dk  and  ek.  Typically  such  a  filter  is 
used  in  a  feedback  scheme  in  order  to  provide  Che 
control  command,  i.e.  we  also  have  an  "output"  equation 

dk  ’  rk  -  M(k]xk 

which  generates  the  command  control.  The  combined 
equatins  are  then 


'  (A[kj'B[k)Mlk)"K!k)ClkJ)xk 
+  B[k)rk  *  K[k]D[k]*k  *  KIklyk 


i.e.  a  multi-mode  system 


F  P  P'  ■  C  C  •  P 
ill  11  i 

F'Q  F  ♦  H'H  ■  Q 

li  l  ill 

The  transformation  to  the  minimal  sensitivity 
coordinate  basis  is  then  obtained  by  balancing  [7]  the 
matrices  .  . 

P  •  t  iV  and  Q  «  I 

Finally,  each  of  the  modes  of  the  filter  is  then 
transformed  to  the  optimal  form.  We  have  worked  out 
the  ideas  for  discrete  time  filters.  The  concept  works 
just  as  well  in  continuous  time  [5],  The  conditions 
are  Che  same  (i.e.  essential  balancedness  of  Che 
averaged  system). 

6.  CO  HOLDS  IOBS 

The  optimal  sensitivity  properties  for  the  multi¬ 
mode  realizations  have  been  derived.  They  extend 
nicely  the  notions  of  essentially  balanced  realizations 
derived  in  [4,5].  These  optimal  realizations  have  been 
applied  to  obtain  an  optimal  implementation  of  a  simple 
multi-mode  filter,  which  allows  the  cracking  of  a 
target  with  low  complexity,  small  wordlength  hardware. 
This  simple  multi-mode  model  can  be  justified  if  Che 
sampling  rate  is  sufficiently  high.  Hore  quantitative 
results  are  presently  under  investigation. 

We  have  restricted  our  discussion  to  square 
systems  (m  *  p)  and  minimal  realizations.  Extensions 
are  straightforward.  It  seems  intuitively  clear  that 
one  could  further  exploit  the  redundancy  of  a  realiza¬ 
tion  by  deliberately  using  nonminimal  realizations. 
A  heuristic  argiaaent  for  this  possibility  is  as 
follows:  Let  N  ■  kn,  and  let  (A,B,C)  be  a  minimal 
sensitivity  minimal  rea.ization  of  a  system  H. 
Construct  now  k  different  realizations  [a.,B.,C  } 

-I  —  1  111 

*  IfjAT  , T^  B,CT  ;  i  *  1 , . . .  ,k}  .  With  these  realiza¬ 
tions  construct  the  nonminimal  diagonal  realization  of 
order  N 

A  •  diag  ( A ^ )  ;  B  *  vec  ( B ^ )  ;  C  *  vec  (C^/k 

If  the  T  's  are  chosen  in  a  neighborhood  of  the 
identity,  such  that  che  rounding  errors  in  each 
component  syscem  ere  independent,  then  as  k  — >  » 


*  F(k]xk  *  C(k]“k 


H[k]xk  +  rk 


with  the  modes  defined  by 


H  •  -M 

i  l 

and  the  input  wfc  is  ( .y^ ]  ’ . 

5.3  The  Optimal  Implementation 

The  equations  for  the  filter  modes  obtained  in  the 
previous  subsection  are  of  the  form  of  the  multi-mode 
systems  in  Section  3.  The  results  obtained  there 
•PPly,  therefore,  directly.  In  particular,  the 
se journ-c imes  { » ^ }  can  be  estimated,  either  via 
simulation  on  che  exact  dynamical  (nonlinear)  system, 
or  in  the  simple  cases,  by  direct  analysis.  The 
gramians  ■  O'O  and  Pj  ■  R.R'  can  be  computed  by 
solving  che  Lyapunov  equations  1  1 


I4yj2  <  I4C.  ♦  C.4A.I2  /klx(k)l2  — >  0 
k  ill  max 


It  is  possible  to  overparameterize  the  syscem  in  order 
Co  obtain  minimal  sensitivity  real i zat ions .  Finally, 
the  idea  in  Che  proof  of  che  main  sensitivity  theorem 
leads  to  gradient  type  algorithms  for  the  optimal 
sensitivity  realizations.  This,  of  course,  is  to  be 
performed  off  line,  during  the  design  state,  and  poses, 
therefore,  no  restrictions  on  the  hardware.  Some 
preliminary  remarks  regarding  these  appear  in  [5]. 
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On  Three-Dimensional  Rotations,  Coordinate 
Frames,  and  Canonical  Forms  for  It  All 

ERIK  I.  VERRIEST 

Some  properties  of  the  eigenproblem  for  a  three-dimensional 
rotation  matrix  are  shown,  and  related  to  the  geometrical  rotation 
parameters.  The  problem  of  assigning  a  unique  canonical  coordi¬ 
nate  frame  to  a  set  of  three  mutually  orthogonal  axes  is  consid¬ 
ered.  The  assignment  is  such  that  it  corresponds  to  a  minimal  over¬ 
all  rotation  with  respect  to  the  reference  system.  This  problem  is 
of  interest  for  the  unique  and  consistent  labeling  of  the  principal 
axes  of  various  tensors  related  to  physical  properties  of  materials, 
and  symmetric  matrices  that  appear  in  various  disciplines  of  engi¬ 
neering. 

INTROOU'.  riON 

In  the.aieas  of  celestial  and  applied  mechanics,  robotics,  the  the¬ 
ory  of  elasticity,  radar  and  sonar,  and  in  nuclear,  molecular  and 
solid-sta<e  physics,  one  frequently  needs  to  express  preferential 
spatial  orientations  (attached  to  a  "rigid"  body)  in  terms  of  some 

Manuscript  received  November  19, 1987.  The  research  was  supported  by 
the  U.5.  Air  Force  under  Contract  AFOSR-87-0308 

The  author  is  with  the  School  of  Electrical  Engineering,  Georgia  Institute 
of  Technology,  Atlanta.  GA  30306,  USA. 

IEEE  log  Number  8821052. 


fixed  reference  system  (the  "laboratory"  system).  Coordinate 
transformations  are  also  of  interest  in  expressing  material  param¬ 
eters  such  as  dielectric  tensors,  electrooptic  tensors,  stress  ten¬ 
sors,  and  so  on.  Any  three  fixed  mutually  orthogonal  lines  inter¬ 
secting  in  0  (e.g.,  obtained  by  solving  the  eigenproblem  for  a  real 
symmetric  matrix),  define  48  possible  coordinate  frames  (of  which 
24  are  right-handed).  As  the  labeling  of  the  preferential  axes  is  usu¬ 
ally  arbitrary,  this  paper  addresses  the  problem  of  providing  a 
"nice"  way  to  uniquely  describe  or  represent  a  preferential  coor¬ 
dinate  frame. 

As  our  solution  relies  on  some  elementary  properties  of  3-D  rota¬ 
tions,  some  basic  properties  of  such  transformations  are  first 
recalled. 

Background  and  Notation 

A  half-line  originating  in  0  (the  origin)  will  be  called  an  axis.  If 
u  is  an  axis,  then  the  axis  parallel  to  u  but  extending  in  the  opposite 
direction  will  be  denoted  by  -u.  By  a  (right-handed)  coordinate 
frame  F,  we  understand  an  ordered  triple  of  mutually  orthogonal 
axes,  following  the  right-hand  rule.  A  frame  consisting  of  the  axes 
u,  v,  and  w  in  that  particular  order  will  be  denoted  by  (u,  v,  w).  F 
is  the  set  of  all  possible  right-handed  coordinate  frames. 

There  are  many  ways  to  specify  the  orientation  of  a  coordinate 
frame  relative  to  another  orthogonal  coordinate  frame  with  the 
same  origin.  Denoting  the  axes  of  the  fixed  reference  frame  by  (x, 
y,  i ),  and  of  the  preferential  coordinate  frame  by  (x',  y',  zl,  it  is  stan¬ 
dard  to  represent  the  rotation  by  the  direction  cosines  of  the  primed 
axes  relative  to  the  unprimed  ones.  One  can  think  of  the  new 
(primed)  coordinate  system  as  the  one  resulting  by  operating  on 
the  original  (unprimed)  system  by  some  transformation,  and  it  is 
well  known  that  the  set  of  matrices  0  representing  these  trans¬ 
formations  form  the  rotation  group  SO(3). 

Since  any  rotation  can  be  represented  as  a  global  rotation  over 
t  e  [0,  rj,  measured  counterclockwise  about  some  axis  u,  a  rep¬ 
resentation  of  the  set  of  three-dimensional  rotations  can  be  given 
in  spherical  coordinates:  longitude  b  and  latitude  i  suffice  to  iden¬ 
tify  the  global  rotation  axis  u,  and  the  radius  r  »  6  describes  the 
angle  of  rotation.  However,  SO(3)  is  not  topologically  equivalent 
to  the  open  (or  closed)  ball,  since  antipodal  points  on  the  surface 
of  the  sphere  represent  the  same  rotation.  A  standard  homotopy 
argument  shows  that  the  fundamental  group  contains  two  ele- 
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merits  [3].  The  covering  of  SO(3)  by  5U(2)  leads  to  the  Cayley-Klein 
parameterization  flj. 

The  eigenproblem  for  rotation  matrices  is  summarized  in  the 
following: 

lemma:  i)  A  rotation  matrix  6  has  all  its  eigenvalues  on  the  unit 
circle.  If  the  rotation  is  nontrivial,  only  one  eigenvalue  equals  +1. 
The  global  geometric  rotation  angle  9  satisfies  cos  B  =  (tr(9)  -  1]/ 
2,  and  the  rotation  axis  corresponds  to  the  global  rotation  vector 
(i.e.,  the  eigenvector  u,  corresponding  to  the  eigenvalue  1).  ii)  The 
real  and  imaginary  parts  of  any  complex  eigenvector  correspond¬ 
ing  to  a  nonunity  eigenvalue  have  the  same  norm,  and  together 
with  the  eigenvector  corresponding  to  the  eigenvalue  + 1  they  form 
a  mutually  orthogonal  set. 

Proof:  Part  i)  is  shown  in[1).  As  for  ii),  let  v  be  the  complex  eigen¬ 
vector  of  9,  corresponding  to  the  eigenvalue  X.  Expressing  u'Qv 
in  different  ways  results  in  u'  Re  (v)  *  u'  lm(v)  =0,  unless  the  rota¬ 
tion  is  trivial.  Similarly,  the  simplification  of  v'Qv  leads  to  v'v  *  0, 
which  in  turn  implies  the  orthogonality  of  Re  (v)  and  Im  (v),  and 
equality  of  their  norms. 

it  follows  at  once  that  {Re  (v),  Im  (v)}  is  an  orthogonal  basis  in 
the  rotation  plane.  A  canonical  parameterization  can  be  shown  to 
result. 

Theorem:  Any  rotation  matrix  has  a  (nonunique)  eigenvalue 
decomposition 

0  *=  lu,  v,  v)  diag  (1,  e1*,  e*")  lu,  v,  vj' 

where  i  lies  in  the  interval  [0,  r],  measured  counterclockwise  with 
respect  to  u,  and  such  that  (u,  Re  (v),  Im  <v)l  belongs  to  F. 

Proof:  See  {4]. 

Labeling  Of  Frames  Associated  with  the  Real  Symmetric 
Eicenproblem 

In  many  problems  the  rotation  matrix  6  is  only  of  intermediate 
interest,  in  particular,  consider  the  real  symmetric  eigenvector 
decomposition  A  **  UKU',  also  known  as  the  Principal  Axis  (Com¬ 
ponent)  Decomposition.  K  is  a  diagonal  matrix  and  U  is  orthogonal. 
If  det  U  •  1,  then  (u,,  u,,  uj  is  the  matrix  of  direction  cosines  of 
a  new  right-handed  coordinate  system  whose  coordinate  axes  are 
aligned  with  the  vectors  ti1(  u2,  and  u3.  Clearly,  this  decomposition 
is  not  unique:  eigenvalues  can  be  permuted,  and  to  each  ordering, 
several  different  choices  for  the  corresponding  eigenvectors  may 
exist,  leading  to  different  orthonormal  coordinate  frames.  For  ref¬ 
erence  purposes,  a  canonical  decomposition  is  desirable.  To  facil¬ 
itate  the  search  for  canonical  forms,  the  problem  is  characterized 
in  terms  of  its  invariants,  introducing  the  following  definitions: 

Definition  1:  A  Symmetric  Eigenvalue  Decomposition  (SED)  is  an 
ordered  pair  (V,  K )  where  U  belongs  to  the  set  F,  and  K  is  an  ordered 
triple  of  real  numbers  (i.e.,  an  element  of  If3). 

Definition  2:  Two  SEDs  (L/f,  X,)  and  (U2,  K2)  are  called 

i)  (Weakly)  Equivalent  ( - )  iff  L/,  diag  (K,)U',  *  l/2  diag  {K3)U2  and 
K,  is  an  ordered  triple  of  the  permuted  elements  of  X2. 

ii)  Strongly  Equivalent  (s)  if  they  are  equivalent  and  K ,  -  IC2. 

The  equivalence  classes  induced  by  the  above  equivalences  are 
nontrivial.  In  the  nondegenerate  case,  the  equivalence  class  of  alt 
frames  equivalent  to  a  given  frame  U  is  generated  by  operating  on 
U  by  transformations  of  the  group  Ge,  consisting  of  the  elementary 
operations: 

Begin 


1)  cyclically  relabeling  of  the  coordinate  axes  x',  y‘,  z‘, 

2)  changing  the  directions  of  any  two  axes, 

3)  changing  the  direction  of  one  axis,  and  switching  the  remain¬ 
ing  two. 

It  follows  that  every  orbit  of  this  group  (equivalence  class  FI. )  con¬ 
sists  of  exactly  24  elements.  In  the  restricted  case  of  Strong  Equiv¬ 
alence,  the  frames  can  only  be  related  by  inversion  of  any  two  axes, 
thus  leaving  only  4  elements  in  each  class  of  FIs.  The  matrix  rep¬ 
resentations  of  the  generators  of  Ce  are: 

’o  0  1~|  T-1  0  o~ 

P  =  10  0,5,*  0-10, 

_0  1  0J  [_  0  0  1_ 

‘  Tl  0  O’]  [-1  0  o’ 

Sb  =  0-1  0  ,  Q  «  001. 

0  0  -ij  L  0  1  °. 

The  generators  of  the  subgroup  G„  associated  with  the  Strong 
Equivalence  are  S,  and  Sb.  The  action  of  the  group  element  G  e 
Ce(G „)  on  F  is  defined  by  C(F)  *=  FCe  F:  The  frame  derived  from 
F  by  group  operation  C  is  the  frame  with  associated  matrix  EC.  The 
degenerate  case  is  discussed  in  [4J. 

Canonical  Forms  for  Real  Symmetric  Matrix  Decompositions 

A  selection  of  a  canonical  form  for  the  decomposition  means  that 
to  each  element  [F]  of  the  equivalence  classes  f/.( or  FIs)  a  unique 
representant  is  assigned  [2).  An  obvious  choice  is  the  frame  obtain¬ 
able  by  a  rotation  of  the  reference  frame  over  the  smallest  possible 
angle.  The  ideas  are  made  rigorous  by  introducing  a  "correlation" 
metric  (  -,  •  >:F  x  F—  R  defined  by  <f1(  f2>  «  tr  F,F{.  The  function 
<  -  ,  •  >  is  not  an  inner  product  on  the  set  of  frames  F,  since  the  latter 
has  not  been  endowed  with  a  linear  structure  (i.e.,  addition  of  two 
frames  or  scalar  multiplication  of  a  frame  are  not  defined).  An  inner 
product  interpretation  is  possible  by  embedding  F  in  a  9-dimen¬ 
sional  vector  space  [4].  As  the  reference  frame  is  represented  by 
the  identity  matrix,  one  obtains  the  "correlation":  </,  F>  **  tr  F  = 
(1+2  cos  9).  The  frame  with  the  minimal  9  e  [0,  x]  is  the  one  with 
the  maximal  correlation.  The  map  f  assigning  this  optimal  frame 
to  each  equivalence  class  is  a  complete  invariant  {2],  and  it  follows 
that  the  set  of  frames  (Fc)  =  f(F)  is  a  set  of  canonical  forms  for  F. 

The  selection  algorithm  for  the  canonical  representation  of  a 
given  frame  F  proceeds  then  by  optimizing  tr  F G  over  the  elements 
C  of  the  groups  G,  or  G„  generated  by  P,  Sa,  Sb,  and  Q,  or  Sa  and 
Sb  alone  for  the  canonical  forms,  respectively,  under  Equivalence 
and  Strong  Equivalence  in  the  nondegenerate  case.  The  pseudo¬ 
code  is  provided  in  the  appendix. 

Appendix 

The  Optimal  Frame  Algorithm  for  the  Nondegenerate  Case 

Given  a  right-handed  frame  represented  by  the  vectors  of  the 
direction  cosines  [x„  xfc,  xj,  the  Canonical  Form  under  Equivalence 
is  obtained  via  the  following  algorithm  (for  Strong  Equivalence  omit 
the  loops  in  i  and  /). 


For  / 0  to  2  do 
begin 

[*.,  xb,  xd  :  -  [x„  x„,  xc]P' 

For  /  :  *>  0  to  1  do 
•  begin 

[*„  x„,  xc] :  *  (x„  x„,  xc]Q' 

Find  diagonal  elements  (x,„  xb2,  xc3). 

If  not  all  signs  are  positive, 

then  find  the  two  columns  whose  sign  change  maximizes  the  trace. 
[x„  x„,  xc] :  **  {x„  xb,  xc]  S,  where  S  is  one  of  S„  Sb,  Sc 
F(2i  +  /) :«  [x„  xb,  xc]  {  Store  a  potentially  optimal  frame  } 

T(2i  +/):—  x#1  +  xb2  +  xCJ  {  Store  its  correlation  with  / ) 

end 

end 

:■*  maximum'1  (  Tfjk) ;  /  “  0  to  5  )  {  Search  for  maximum  ) 

Frame  :»  F(k {  Output  the  optimal  frame  ) 

End. 
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An  Improved  Algorithm  for  Low-Pass  to 
Bandpass  Transformations 

STEPHEN  A.  DYER 

An  algorithm  is  presented  for  computing  the  coefficients  of  a 
-ontinuous-time  bandpass  transfer  function,  obtained  by  applying 
e  standard  transformation  to  a  normalized  low-pass  prototype, 
ihe  method  has  all  the  desirable  features  of  a  recently  described 
algorithm  while  achieving  increased  computational  efficiency 
through  use  of  a  recursion  relation. 

I.  Introduction 

In  a  recent  letter  [1],  an  algorithm  was  presented  for  performing 
the  standard  low-pass  (LP)  to  bandpass  (BP)  transformation.  The 
method  is  quite  general,  being  both  independent  of  filter  order 
c..d  applicable  to  prototypes  having  both  poles  and  zeros.  Since 
it  is  algebraic  in  nature,  it  provides  excellent  accuracy  independent 
of  choice  of  scaling  factor. 

The  algorithm  presented  in  the  following  shares  all  the  desirable 
traits  of  that  in  [1],  However,  while  the  method  in  (1]  requires  the 
evaluation  of  a  set  of  binomial  coefficients,  the  present  algorithm 
employs  a  recursion  relation,  resulting  in  decreased  computa¬ 
tional  effort. 

II.  Development  of  the  Algorithm 

We  wish  to  obtain  the  coefficients  of  the  BP  transfer  function  RIs), 
obtained  from  the  normalized  LP  transfer  function  Pis)  by  the  trans¬ 
formation 

RIs)  -  Pts)|,  -  tU>  ♦  aivWll  -  «»  ♦«!-’  0) 

where  «o  is  the  desired  center  frequency,  in  r/s,  of  RIs);  W  is  the 
desired  bandwidth,  in  r/s,  of  RIs);  a  =  1/VV;  and  0  «  uJ/VV. 

The  LP  transfer  function  Pis)  in  (1)  is  assumed  to  have  the  general 
form 

N  v 

2  a„s" 

Pis)  -  - ■  (2) 

2  bmS" 

m  •  0 

The  BP  transfer  function  RIs)  has  the  form 

2  3V 

RIs)  -  -tt* - .  (3) 

2  bmsm 

mm  0 
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The  transformation  (1),  when  applied  to  (2),  yields 

N 

2  a„[as  +  0s ’Ts1 

RIs)  «  — -  (4) 

2  bjas  +  0s ‘TV 

mm  0 

where  y  =  max  IN,  M).  The  factor  sVs’  is  included  to  obtain  a  form 
for  RIs)  which  contains  only  nonnegative  powers  of  s. 

We  concentrate  for  the  moment  on  the  numerator  polynomial 
in  (4),  writing  it  in  the  form  of  a  power  series,  as  in  (3).  We  have, 
then. 


t  N 


2  3,s‘  =  2  a„[as  +  0s' 

*•0  n - 0 

N  9 

'TV 

IS) 

*  2  a,  2  ck.^ 

n m 0  k *0 

r  P  N  I 

k 

(6) 

*  Z  Z  anckn 

k  mQ  [_n  -  0  J 

sk 

(7) 

where,  from  the  right-hand  sides  of  (5)  and  (6),  •>  «=  y  +  N 
the  ik  can  be  found  as 

Thus, 

N 

a*  -  2  a„ck.„  k  •  0,1, 

(8) 

n- 0 


Similarly,  the  bk  of  (3)  can  be  found  as 

M 

5,-2  bmckm  k  -  0, 1,  ■  •  •  ,  n  (9) 

m  ■*  0 

where  p  -  y  +  M.  Actually,  the  range  of  n  and  m  in  (8)  and  (9), 
respectively,  can  be  restricted  further.  This  matter  is  discussed  in 
Section  IV. 

We  need,  however,  to  determine  the  c*,„  before  (8)  and  (9)  can 
be  applied.  From  (5)  and  (6), 

2  ck.„sk  -  (as  +  0s -W  (10) 

*•0 

-  (as  +  fls^KaS  +  0$'V~  V 

-  (OS  +  /3s'1)  Z  Cfc.n-,5* 

k  m  0 

2y 

-  Z  ac*fl.,s*  +  1  +  Z  &ck  n _ \  (11) 

*-0 

Here,  the  upper  limit  on  the  sums  is  set  to  2y  so  that  (10)  can  be 
applied  to  either  (8)  or  (9)  as  needed. 

After  changes  of  variables,  (11)  becomes 

2y  J-r-l 

2  ck  nsk  *  2  ac*_t„_iS*  +  2  0ck  +  in~iSk.  (12) 

*-0  t-1  »--1 

So,  upon  equ.  ,.ng  coefficients  of  like  powers  of  s  in  (12),  we  obtain 
the  recursion  relation 

*  -  1,  •  •  •  ,  2y  -  1 

G*.n  “  ac,_1n_,  +  0c,  (13) 

n  -  1,  •  •  •  ,  7. 

For  n  «  0,  (10)  gives 

0.  k  =  7 

C*0  *  )  (14) 

(^0,  otherwise. 

Also,  (12)  yields 

Co.»  -  dc,.„.v  n  -  1,  •■•,y  (15) 

and 

Cty.n  -  oc2l.,.„.v  n  -  1,  ■•■,y.  (16) 

III.  The  Algorithm 

The  BP  coefficients  3,  and  6,  of  the  RIs)  in  (3)  are  computed  as 
follows: 

1)  Accept  N;  M;  <*<,;  W;  a„,  n  -  0,  •  •  •  .  N;  bm,  m  -  0,  ■  •  •  ,  M. 
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Abstract 

Periodic  discrete  time  systems  are  analysed.  In  particular  we 
investigate  the  Invariants,  Parametruations,  Canonical  Forma, 
and  Realisation  from  input/output  data  for  such  systems.  It  was 
found  that  the  classical  realisation  theory  lor  time  invariant  eye- 
terns  carries  over  very  nicely  to  such  systems.  For  notational 
simplification,  some  results  arc  worked  out  for  the  alternating 
(Le.  period  two)  single  input  single  output  discrete  time  system 
only.  A  novel  definition  for  an  Operational  lYansferfunction  is 
given,  which  is  useful  in  studying  reductions,  realisations  and  in¬ 
terconnections  of  such  systems. 

1.  Introduction 

This  paper  deals  with  periodic  discrete  time  ey stems  of  period 
N.  To  fix  the  ideas,  a  state  space  realisation  of  such  systems  is 
of  the  form 


*k+t  *  ■'%<*)** + 

Vt  ■  Crfk)*k 
p(k)  =  k  mod  S 

The  IV- tuple  {Eo.Ei . E/v-i)  where  £,  is  the  triple  (A,  Bj, 

Ci)  will  refer  to  such  a  realisation.  These  systems  arise  for  in¬ 
stance  by  discretisation  of  periodically,  non-uniformly  sampled 
continuous  time  systems,  and  more  general  periodically  switched 
systems.  In  order  to  simplify  the  ideas,  we  shall  sometimes  look 
at  the  special  case  of  alternating  (i.e.  period  two)  single  input 
singls  output  discrete  time  system.  The  main  ideas  for  the  gen¬ 
eral  case  are  not  different,  but  only  more  complex  in  notation. 

While  tbeee  systems  are  in  many  ways  mors  complex  than  or¬ 
dinary  time-  invariant  systems,  they  have  still  much  more  struc¬ 
ture  than  general  time  varying  discrete  time  systems  analysed  by 
Kamen  [3],  or  even  the  multi-mod*  systems  described  by  Stanford 
et  al.  [2j,  and  Helmke  [1],  and  one  can  develop  a  parametrisation 
theory  for  these  systems  which  is  in  does  analogy  to  the  known 
geometric  theory  for  stationary  systems  (Haxewinks!  (4j). 

In  particular,  the  input/output  behavior  of  such  systems  is 
left  invariant  by  the  traneformation  group 

C£.(R)  x...x  GLn(Jt)  (Ncopiae) , 

and  the  orbit  space  of  the  controllable  system s  is  •  manifold 
which  can  be  decompoeed  into  generalised  Kronecker  calls  which 
form  a  cellular  patch  complex.  The  canonical  forms  act  as  local 
coordinate  systems. 

Our  next  main  result  involves  the  realisation  of  such  a  system 
from  the  knowledge  of  the  impulse  response  sequences 

^  j  a  0,lp.. 

3.  I/O  Equivalent  Time- Invariant  Representations 
for  period-N  Systems 

Soma  preliminary  definitions  and  notations  will  be  given  fat 
this  section.  Also,  the  observability,  reachability  and  stability 
properties  will  be  discerned.  The  properties  and  rsprsesntstions 
are  the  key  to  the  realisation  given  in  section  4.  We  shall  discuss 
the  general  case  for  ^-periodic  systems  in  this  section. 


Given  the  ^-periodic  system  {Eo,Ei,...,E*_i},  let  the  re¬ 
sponse  of  the  system  to  a  puls*  occurring  at  instant  j  <  S  be 
the  sequence  A,  j  ;  t  >  j.  The  system  response  is  readily  seen  to 
be  (where  (k)  indicates  k  mod  S) 


* tj  *  Cji|Aj<_l]A(j_*|...A{>+ljB[,|  « >  j  (1) 

**  0  else 

Define  the  “Hankel*  Matrices  for  this  Periodic  System  as  the 
matrices  Hy+i  whose  («, l)-element  is  hy+4ay_t+i.  This  matrix 
does  not  have  the  (block)  Henkel  structure  as  in  time  invariant 
system*.  However,  it  still  allows  a  factorisation  in  an  observ¬ 
ability  and  a  reachability  matrix  (as  defined  in  the  time-varying 


Rj>i  *  Oj+iR  (2) 

e.g.  the  o-th  block  entry  in  Oj+\  and  the  h-th  block  of  R,  are 
respectively 

[Rl»  *  AyiAtf-ti  •  •  •  Ay+»_*  1%+,-*)  (3) 

Pi+il.  *  •  •  •  Ab>»l  W 

For  fixed  j  in  X . N,  the  derived  sequence  h»  s=  hy+ij;  k  >  0 

is  also  the  response  to  a  unit  pulse,  of  the  following  augmented 
time  invariant  system  of  order  nN.  (Note  that  £*  E  £o) 


A. 


0  0 

Ai  0 

0  A,  0 


0  0  ...An- i  0 


Cm  ■  [  Gi  Cj . . .  Cn  ] 


(5) 


with  rsad-in  matrix  [0, . . .  0,  By ,  0, . . .  0j'  where  the  nonxero  block 
By  occurs  in  the  {j  +  l)-th  block  position.  Such  a  time  invariant 
representation  of  the  pulse  response  sequence  Hij  ;  i  >  j  will  be 
called  an  Adiabatic  representation.  The  corresponding  Adiabatic 
Hankelmatricw  ks  with  (a,  &)-*l*m*nt  hy+4+»_  y,  will  have  the 
true  Hankel  structure.  The  subscript  *c«*  refers  to  'cyclically 
au (man ted* .  The  above  representation  is  in  general  not  minimal. 
A  minimal  realisation  of  the  adiabatic  Hankel  matrix  Ay  will  be 
denoted  by  (Ay,  Ay,  6y). 

In  order  to  treat  all  Aj’s  at  once,  an  equivalent  composite 
system  (the  Cyclically  Augmented  System)  of  Nn  states,  Nm 
input*  and  p  outputs,  is  dsfined  as  the  realisation  (Am, Bm,Cm) 
where  Am  and  Cm  are  as  fat  (2),  and  defining  e  BM-matrix  a* 


0 

0 


[0  0...  Bn-i 


(«) 
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Lotting  6j[i)  denote  the  Z«*-tren*form  of  the  thifUd  se¬ 
quence  ho+jj  :  k  >  0,  then  the  transfermstrix  of  the  cyclically 
augmented  system  ia  simply 

H„(s)  *  [4>(*),  5i(x), ....  £*-t(*)J  (7) 

The  £/(»)  ere  the  transfermatricss  of  the  ADIABATIC  eye- 
terns,  end  it  follows  from  the  previous  discussion  that  they  ere  re¬ 
alised  in  a  non  minimal  way  by  (A**,  [0|  -  -  -  *  0,  Bj ,  0 0]',C«), 

the  aonsero  element  in  the  B- matrix  occurring  ia  th e  (j  +  l)st 
block  position. 

Xemarki 

1.  Dually,  we  can  also  work  with  an  equivalent  ( n#,m,#p ) 
system,  thus  treating  the  periodic  system  as  an  equivalent 
stationary  #p-output  and  m-  input  system. 

2.  Classical  realisation  theory  for  multivariable  time- invariant 
systems  enables  us  to  find  a  minimal  realisation  [F,G,B) 
tor  the  above  Hank  el  matrix.  This  minimal  realisation  is 
then  the  key  to  the  rest  of  our  development.  In  particular, 
since  the  equivalent  stationary  system  captures  all  of  the  in¬ 
put/output  information  of  the  periodic  one,  to  will  it#  min¬ 
imal  realisation  (F,G,  H).  A  parametrisation  for  the  peri¬ 
odic  eystems  follows  then  directly  from  the  parametrisation 
of  the  multivariable  system  (/,  G,  B).  At  once,  we  see  that 
even  a  scalar  periodic  system  leads  to  multivariable  equiva¬ 
lent  systems.  The  restriction  to  scalar  systems  mentionned 
at  the  onset  is  thus  not  restrictivs,  but  permits  simplsr 
notation  and  examples. 

We  indicate  some  particular  results  which  wQl  be  useful!  in 
the  realisation  problem 

Thtonm  1:  The  minimal  realisations  (Aj,  £,,<?<)  of  the  sdi- 
abatic  Hankelmatrices  Ai  have  the  property  that  det(*l  —  A*) 
divides  det {z* l  -  Ao  - .  •  Aar- 1) 

Proof  (for  N  =  2):  The  Hankelmatrix  6o  >»  obtained  from 
the  pulseresponse  Kfi-  Its  ff-transform  equals 

£o(s)  *  Ch(s*/-AiAo)-,AlBo  +  Ci(*,/-AoAi)-l*Bo 
-  (!Vb(*,)  +  /fi(**)l/det(*»I-AoAl) 

for  some  polynomial  matrices  #o  and  Si.  Clearly  then  the  min¬ 
imal  realisations  of  and  6i  have  the  above  ststed  property. 

In  fact,  it  is  easy  to  show  that  tha  realisations  of  Ho  and  Hi 
must  be  very  closely  related.  Indeed,  by  rewriting  Hi  in  the  form 

'[51s* 

The  first  factors  on  the  left  also  appear  in  the  expansion  of  Ho 

Hence,  if  ere  define  the  foDowing  transfermatrix: 

r.-p.-c.i [*■'-/'*  £] 

then  to  -  toi[BJ,0]'  and  tj  «  toilO.^. 

This  obeervstion  leads  directly  to  the  following  theorem: 
Theater*  i:  There  exists  an  observable  pair  (A,£)  and  ma¬ 
trices  Bo  and  fli  such  that  (A,£q,£),  and  (A, Bi^C)  realise 
respectively  the  adiabatic  transfer  matrices  to  and  t]. 

Proof:  Lst  (A,  B,C)  be  a  (observable  is  sufficient) 


realisation  for  toi,  then  Bo  —  t[Bo,0]',  and  —  B[0,  B'jj’. 

The  importance  of  this  theorem  lies  in  its  use  to  find  the 
realisations  for  an  alternating  system.  Given  the  pulse  response 
sequence*  0  and  K,u  we  can  use  the  realization  algorithm  from 
time  invariant  systems  to  determine  minimal  realisations  of  either 
sequence.  By  the  theorem,  these  realisations  can  be  extended  by 
addition  of  uncontrollable  states  if  necessary,  to  observable  real¬ 
isations  with  the  same  A  and  C  matrix. 

3.  Reachability,  Observability  and  Stability 

Definitions: 

•  The  #-reriodie  system  {Eo,Ei,. ..,£*_}}  is  said  to  be 
uniformly  ^reachable  (reachable  in  p  steps),  iff  every  state 
can  be  reached  in  p  steps,  independently  of  the  starting 
event  (=  initial  time  and  initial  state).  The  system  is  said 
to  be  uniformly  reachable,  iff  there  exists  s  p  >  0,  such  that 
it  is  uniformly  p- reach  able. 

e  The  system  is  said  to  be  uniformly  observable  in  p  steps  iff 
the  initial  state  xy  can  be  uniquely  determined  from  p  con¬ 
secutive  outputs  yy . y/+p-i,  independently  of  the  start¬ 

ing  time  j.  The  system  is  said  to  be  uniformly  observable 
iff  it  is  p-o bee rv able  for  soma  p. 

Tkeortm  S:  The  period-#  system  {Eo,...,E*-i}  is  uni¬ 
formly  reachable  iff  the  reachability  matrices  (3)  have  full  rank 
for  all  j.  The  system  is  uniformly  observable  iff  the  observability 
matrices  (4)  have  full  rank  for  all  j. 

The  proof  ia  easily  aatablished  by  a  standard  argumsnt  [5]. 
Sines  ths  adiabatic  systems  of  at  most  order  nS,  provide  an 
underlying  time-invariant  structure  in  the  problem,  at  moat  n# 
steps  need  to  be  considered  for  checking  uniform  reachability  and 
observability,  by  virtue  of  the  Cayley-  Hamilton  Theorem.  Some 
direct  corollaries  of  the  theorem  are: 

I)  The  Cyclically  Augmented  system  (A*.,  Bc.,Ce.)  is  reachable 
iff  the  period-#  realization  {Ei,E|,...,Ejv}  is  uniformly 
reachable. 

tl)  (Si, Es ...  i E<}i*  uniformly  observable  (reachable) 

iff  {Eo,Ei,...tEjv-i}  is  uniformly  observable  (reachable), 
whence  the  invariance  of  uniform  observability  and  reach¬ 
ability  under  a  cyclic  shift. 

IB)  Using  the  backward  propagation,  we  can  write  the  output 
at  time  •  in  terms  of  the  previous  inputs,  us.,  we  look  at 
hij  for  fixed  i,  and  define  the  equivalent  stationary  systems 
with  the  sbove  A-matrix  and  C  =  (0,...0,Ci,0,...,0),  the 
nonzero  block  occuring  in  the  i-tb  block  position,  and  B  * 
[5q, Bj,..., By_j)'.  We  then  have  the  *du*lity*-property: 
{Ej,...,Ey-i,E*}  is  unif.  observable  iff 
{Ejy,Ejy_t,...Ei}  is  unit  reachable,  where  the  'dual* 
system  is  obtained  by  time  reversal  of  the  sequence  of  the 
duals  Ef  of  ths  realizations  E,,  where  (Ai.R.Ci)4  is  ths 
triple  (A|,CJ,BJ).  We  are  thus  led  to  the  definition: 

{Et,Es,...,Eo}^u**  •  {Eo,E^_|,...,Ei}  (>) 

Finally,  wa  remark  that  if  all  A<  are  aoasingular,  as  for  in¬ 
stance  in  the  important  ease  of  ths  discretization  of  s  continuous 
system,  the  criterion  of  Theorem  1  can  be  simplified  by  virtue  of 
the  following 

imiu.'  V  the  Ay  are  nonaingular  for  all  j,  then  the  foil  rank¬ 
ness  of  on#  at  ths  reschsbility  matrices  K,  (observability  matrices 
0()  implies  ths  foil  rankness  of  all  others,  and  hence  reachability 
(observability). 

Am  an  example,  a  rise  alternating  system  Eo.Ei  will  be  uni¬ 
formly  reachable  iff  the  stationary  systems  (A|Ao,  [ki,AiV>]) 
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(AoAi,  [&o>  Ao&il)  are  teachable.  If  th*  system  if  uniformly  reach¬ 
able,  bo  don  than  2n  step*  an  required  to  reach  any  desired  end- 
state.  If  the  product  AoAi  k  nonaingular,  then  (A1A0,  {*r,  Aiko}) 
and  (AoAi,  (4o,  Ao^i])  are  either  both  reachable  or  both  non- 
reachsbl*.  By  applying  inputs  before  0,  oce  gets  the  reachability 
(elation  at  time  0; 

so  “  Jfi[tlo,*-l,...f 

where  iti  =  [*i,  Ailo.AiAoki.-.-]  *  th*  time  varying  reachability 
matrix  [3].  Observation  of  the  output  sequence  after  time  0,  with 
so  input  applied  leads  then  to  the  observability  relation: 

where  Oo  «  [co,  A(,  A'jC^, . .  •j'  is  the  time  varying  observ¬ 
ability  matrix.  Similarly,  we  construct  the  reachability  and  ob¬ 
servability  matrices,  So  and  Or,  relating  to  the  reference  time  1. 
The  products  OqRi  sad  Oj  So  are  then  the  alternating  (period-2) 
Hankelmatrices  defined  in  (2). 

We  also  have  the  following  important  stability  theorem: 

Tktartm  4:  The  JV-period  system  (0)  is  etable  if  the  eigen¬ 
values  of  the  product  AoAi  . . .  Aff-i  hare  modulus  leas  than  I. 

Proof:  The  convergence  properties  of  the  periodic  systems 
are  determined  by  the  convergence  properties  of  the  equivalent 
time  invariant  system  (X*,,  B.*,Cm).  The  letter  is  completely 
determined  by  the  characteristic  polynomial 
det(**.r  —  Ai Ai A* . . .  A/r)  «  0. 

The  problem  with  this  approach  ia  that  tha  resulting  time- 
invariant  system  has  order  Nn  if  »  ia  the  order  of  the  individual 
realisation*  Hi-  The  original  periodic  system  is  only  of  n-th  or¬ 
der,  so  that  a  "hidden  inodes* -phenomenon  occur*. 

4.  Canonical  Forms,  Parametrliatlon  and  Topological 
Structure 

The  first  object  in  this  study  is  to  find  the  transformation* 
on  the  realizations  that  leave  the  input-output  behavior  (>.*.  all 
adiabatic  transfermatrices  and  the  the  periodic  system  'Banksl* 
matrices  (2))  invariant. 

Let  {Eo,Zi,...,£y_i}  be  a  realization  of  an  S -period  sys¬ 
tem.  Denote  an  element  of  the  group  Gln(S)N ,  denoted  by  Gt* 
tor  abort,  by  The  group  action  is  defined  by 

:  (Ao.Bo, C»),...,(Asr-»,ftv-i,C|f«j)  — • 

lPi^\PlSo.CtPS'l).lP>Atr.tPfii,P,S/f.l,C/f.,P^it)  (9) 

The  states  transform  aa 

*/fk  — 

•JCH-I  - *  Pi*Nh+ 1  (10) 

The  following  property  ia  readily  shown: 

Tktortm  S:  Equivalence  of  State  Space  Representations. 

The  product  group  Gln(R)H  action  on  the  set  of  period-# 
system*  leave*  the  I/O  properties  invariant. 

Ones  tha  symmetry  group,  (Ls.  the  group  whose  action  leaves 
the  I/O  behavior  invariant)  is  established,  we  can  look  at  the 
question  of  canonical  forma:  Any  property  of  the  original  system 
can  be  described  aa  a  map  from  the  set  of  systems  £,  to  some 
suitable  eat  S.  After  introducing  canonical  forms,  the  study  of 
the  original  function  f  is  then  replaced  by  the  study  of  some 
•simpler*  function  /  :  C  —  S,  such  that  /«/•»,  where  x 
is  the  canonical  projection  »  :  £  — *  C  on  the  set  of  canonical 

For  notation*!  simplicity,  the  rest  of  this  section  will  be  re¬ 
stricted  to  alternating  system*.  The  above  development  should 
give  enough  insight  to  realize  that  the  general  principles  remain 
tha  earns.  Canonical  forms  for  the  unifomly  reachable  systems 
are  obtained  by  tbs  usual  Kroo acker  selection  procedure.  La. 


among  the  2n  columns  of  Sc,  select  n  linear  independent  ones, 
which  form  a  basis  fic  lor  the  state  space.  In  particular,  a  unique 
•nice*  selection  may  be  cboeen  according  to  tbs  Young  or  ‘crate*- 
diagram.  Similarly,  let  Pi  be  another  basis,  cboaen  by  a  nice  se- 
lection  among  the  columns  of  Si.  Now  express  the  system  with 
respect  to  the  basis  which  is  alternating  between  Pc  end  }\.  e.g. 
Ac  ia  represented  by  the  a  new  matrix  whoa*  /- th  column  is  the 
representation  in  terms  of  th*  basis  Pi  of  Ao  operating  on  the 
y-th  baaisvector  from  the  other  basis  Pq. 

Th*  affect  is  that  the  new  representation  w  of  the  form  io  = 
hi  *  (1,0,... 0J'.  (By  aaeumption  of  reachability  neither  &q  nor 
hi  are  aero).  If  v<(k)  denote*  the  petition  of  the  k-th  basis  vector 
from  Ri,  then  ere  refer  to  the  sequence*  w,  *  sj(I),...,e<(«)  u 
a  multi-index.  The  k-th  column  of  the  new  A-matrice*  are 

AJe*  *  f*oe.i(k)+i 

Aje*  at  RJe^e^i  (11) 

However  straightforward  the  previous  extension  of  the  known 
scheme  may  be,  a  particular  nice  form  is  obtained  as  follow*  if  Ai 
ia  noseingular.  Search  the  columns  of  So  in  their  natural  order, 
i.e.  from  left  to  right,  and  reordered  in  chains,  as  in  the  usual 
•scheme  II*  search  [5],  Now  observe  that  if  Ai  is  nonsingular, 
then  the  result  of  AI  operating  on  the  above  basis,  is  also  a  basis, 
and  ia  fact  each  of  these  new  basis  vectors  will  be  s  column  ia 
£i,  except  perhaps  for  the  last  new  besisvector.  In  that  case, 
it  may  be  substituted  for  &i  as  new  last  basis  vector.  Note  that 
this  all  corresponds  to  a  scheme  H  search  in  Si,  but  STARTING 
AT  A{&o-  It  follows  that,  unless  there  was  a  full  length  (=  n) 
chain  Aoki,...t(AoAi)*~1Aofti  in  So,  (which  only  happens  if  in 
is  zero),  the  operator  Ai  is  represented  by  /.  The  hi  vector  is  full 
in  general.  The  pair  (Ao,  to)  has  a  canonical  controllability  form 
[5]  representation.  In  the  latter  special  case,  it  follows  that  Ao 
ia  represented  by  a  cyclically  down  shifted  identity  matrix,  and 
Ai  by  a  cyclically  down  shifted  right  companion  (controllability 
canonical  form)  matrix.  The  new  io  (obviously)  remains  zero, 
and  the  new  ii  is  a  full  vector.  As  the  co  and  ci  have  no  particu¬ 
lar  structure,  there  art  4n  free  parameters  in  this  canonical  form. 
By  analogy  to  the  stationary  realization*,  ere  shall  refer  to  this 
form  as  th*  controllability  canonical  form.  Note  that  beesuae  the 
search  extends  over  2n  columns,  the  new  A-matric*s  will  in  gen¬ 
era!  not  bs  in  the  usual  companion  form  themselves,  unless  the 
system  is  uniformly  reachable  in  n  steps,  but  then  this  is  also  the 
case  with  the  time  invariant  multivariable  systems.  In  fact,  it  it 
exactly  because  of  such  a  reduction  from  the  timevarying  to  th* 
multivariable  time-invariant  case  that  all  the  topological  proper¬ 
ties  of  these  systems  ar*  expected  to  carry  over.  In  particular, 
for  multivariable  systems  we  havs: 

Tktortm  6:  Th*  orbit  space  of  the  reachable  eystems  is  an  an¬ 
alytic  manifold,  which  can  be  decomposed  into  generalized  Kro- 
necker  cells  which  form  a  cellular  patch  complex.  Th*  state  epac* 
canonical  forms  act  aa  local  coordinate*. 

Th*  number  of  canonical  form*  that  is  required  to  cover  the 
space  of  all  reachable  alternating  syatema  is  also  equal  to  th* 
number  of  pair*  of  nice  multi-indices  that  can  be  chosen.  Th* 
information  given  her*  b  rather  sketchy,  bnt  th*  details  will  be 
presented  in  a  forthcoming  paper  |6j. 

(.  Oper 'Monad  Transfer  Function 

Because  of  spec*  limitations,  very  little  will  be  said  her*.  Th* 
essential  ingredient  is  the  introduction  of  a  sampling  operator  x, 
with  **  *  x,  and  which  does  not  commute  with  tbs  shift  operator 
s'1,  hi  fact,  w*  have  *-,x*  m  1— x,  from  which  xx+*(l— x)  mg. 
Th*  equation  *»*+. i  *  Ao*»  +  Bo“»  ■  then  transformed  to  s(l  — 
*)•*(*)  »  Ao»X(s)  +  5q*CT(i).  Similarly,  th*  complementary 
equation  transforms  to  **X[g)  «  Aj  (l  -  t)X (*) + Bi  (1  -  x)£7(e). 
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Addin*  yields  the  fonn  X(s)  »  (x/  -  A(*))“‘B(*)Cr(x),  where 
A(»)  *  Ag*  +  Ai(l  -  »)  and  B(»)  =  Bo*  +  Bi(l  -  *).  Defining 
•lao  C(*)  »  Co*  +  Ci(l  -  »},  w*  gtt  tha  Operational  Transfer 
Auction  aa 

*(*,*)«  C(*){*I-A(*))-‘B(*) 

i  ,  m  tranaferm-.trix  with  coefficient*  in  thx  polynomial  ring 
A[*|**  *  *].  Using  the  noncommutativa  relation,  this  formalisni 
ia  very  helpful  in  deriving  all  aorta  of  results  and  transferfunction 
operations  For  instance,  defining  the  odd  and  even  part  of  G(s) 

*[<?(*)! «  C,(s)  « 

(x-*)[c(,)i«  c.(x)  «cWr_g(r*l 

then  the  commutation  relation  implies  for  the  operator! 

C.(x)*  *  *C,(x)  C,(s)»  «  (1  -  *)C,(*) 

and 

C(x)*=*C.(*)  +  (l-*)C#(*) 

For  instance: 

(x/  -  AO'1*  *  [(1  -  *)xJ  +  »Af][x,f  -  Af1]-1 

This  last  rule  allows  to  write  the  OTF  of  the  period-2  system  aa 

C.[x*/  -  AiAo]_,[AlBo*  +  xBj(l  -  *)]  - 
+Ci[x*/  -  AoAlp,[*BQ*  +  AoB4(1  -  *)J 

The  following  duality  is  also  very  helpful  in  reduction. 

B(x,*)  *  Bj(x)*  +  B»(x)(l  —  sr) 

*  *Si(*)  +  (1  -  *)5j{*) 

where 


Finally,  the  reachability  and  observability  conditions  derived  in 
the  previous  section  are  also  readily  obtained  with  this  formalism. 
It  can  also  readily  be  extended  for  use  with  period  AT  systems. 
The  commutation  relation  is  different  Le.  ZH~X*  +  Z*~*wZ  + 

(.  Realisation 

In  this  section  we  show  how  the  above  results  can  lead  to  a 
relatively  simple  realisation  algorithm  for  periodic  systems.  We 
shall  assume  that  the  order  n  of  the  minimal  Af -period  system  is 
known.  There  is  then  an  underlying  uniformly  reachable  system 
£q,...,Ejv-1  of  order  n.  The  realisation  is  given  in  4  steps: 

Step  It  Let  the  pulse  responses  >  l,...,k/r-t;*‘  >  AT 
be  collected.  With  this  data,  the  adiabatic  Haakelmatricae  Ay  , 
y  w  0, . . . ,  AT  -  1  are  formed.  There  are  now  two  possible  routes: 
realise  each  adiabatic  system  separately,  or  realise  the  compoeitte 
system  with  tranefermatrix  [A0(x),...,  Ajv-j(x)]. 

Step  St  Obtain  a  minimal  realisation  (Am.tBa^.Ba^x,..., 
R-  y-i],C— )  at  the  system  with  composite  transfer  matrix  (£g(x), 
...  ,A/r_^(x)j,  where  &<(*}  iethe  transfer  matrix  of  the  sdiahetir 
system  (Ai.Bt,^). 

Then  are  two  ways  in  which  this  can  be  obtained.  The  fort 


ia  to  reduce  the  composite  transfermatrix,  and  obtain  a  minimal 
realization  of  it,  by  standard  multivariabls  techniques  [5].  The 
second  method  consists  in  first  obtaining  minimal  realization. 
(*,A.  ,(?i)  of  the  adiabatic  systems.  These  minimal  realizations 
may  be  of  different  dimensions.  However,  by  the  theorem  I,  there 
exists  e  maximal  characteristic  polynomial,  of  order  g  say,  in  the 
tease  that  the  characteristic  polynomials  of  the  other  realizations 
divide  it,  and  each  non-minima]  adiabatic  realization  can  be  ex¬ 
tended  by  adding  a  non-reachable,  but  observable  subsystem,  so 
that  all  extended  realisations  of  the  adiabatic  susbeyst ems  have 
the  same  order  (=  g),  and  the  same  A  and  C  matrix.  The  com- 
poeite  realization  (A„,  [B„,o, . . . ,  B«/-i],Cm)  is  then  the  de¬ 
sired  form  and  is  minimal  since  (A*,,Ca,)  is  observable,  and  at 
least  ont  at  the  Bm>  forms  together  with  Am  a  reachable  pair. 

Step  3:  Extend  the  realization  at  order  «  obtained  in  step 
2,  to  one  whoee  order  is  a  multiple  at  AT,  by  adding  a  non- 
observable  but  reachable  subsystem.  Indeed,  since  the  mini¬ 
mal  system  (Am,  [Bmp, ....  B«jf-i),C*)  and  tbs  cyclically  aug¬ 
mented  system  (A««,BM,CM)  realize  the  same  transfer  matrix, 
and  since  .be  cyclically  augmented  system  is  uniformly  reach¬ 
able  by  assumption,  the  latter  must  be  an  extension  ( A,,B„C «) 
of  (A.,,  [Ba^o . BmJV-iI.Cm)  by  a  non-observable  but  reach¬ 

able  subsystem  of  order  Nn,  —  f.  This  implies  the  existence  at 
matrices  X,Y,Za,...,Ztr-i  to  that 

[t  £][£•• ::::  £r]lc-  »1 

is  similar  to  the  cyclically  augmented  system.  This  augmentation 
must  not  impair  the  reachability  of  the  realization.  The  necessary 
reachability  of  the  subsystem  implies  that  none  of  the  raws  of 
tbs  matrix  [X,Zo,-..,Z/f-i]  can  be  sero.  This  follows  saaily 
by  contradiction.  If  (AT,  2o, ... ,  Zn- i]  had  a  zero  raw,  then  the 
realization  could  be  partitioned  as 

Am  0  0  BmO .  Bmjt-l 

Xj  Yi  Zo,  Z/t-i 

0  0  Y»  0,  ...,  0 

which  has  the  un-reachable  subsystem  [Fj.O.Cm]- 

Tbs  X,Y,  and  Z,  are  chosen  so  that  the  reachability  matrices 
R(A’;[B,o,A«B,iI),  and  fi(A,j;[Brt,A4B,l])  have  rank  less  than 

n. 

Step  4:  Determine  the  similarity  transformation  that  trans¬ 
forms  the  extended  system  (A>>[B«z,...,B«jr_i],Ct)  to  acyclic 
form.  For  notational  convenience,  we  shall  again  discuss  the  lat¬ 
ter  for  alternating  (i.e.  period-  2)  systems.  Tbs  ideas  for  period- 
Af  systems  art  similar. 

The  reachability  matrix  for  the  cyclically  augmented  matrix 
has  ths  structure 

_  [  Bb»  0,  0,  AoBg,  AoAtBo,  0  1 

**  [  0,  Bi,  A, Bo,  0,  0,  AlA0Bi  J 

whereas  R,  has  entries  ia  all  positions  ia  general.  The  denied 
similarity  maps  R,  into  TJt,  m  Rm-  Partitioning  T  into  [TJ,TJ]*, 
H  is  seen  that  ths  sero  locations  ia  the  above  equation  leads  to 
the  identities: 

T\B(a2;[B«i,A*B«oD  <■  0 

rfB(Aj;(B4o,A.BaD  -  0 

where  the  reachability  matrices  extend  to  n  (block )columnt  only, 
and  an  thus  square  for  single  input  periodic  systems.  On  the 
condition  that  the  test  matrices  Bn  *  A(Aj;  [Bn,  A,Bai]),  and 
Bit  •  B(Aa;[Bn.A,B,l]),  have  raak  leas  than  n,  it  is  possible 
to  Bad  n  En early  independent  rowv acton  ia  the  left  anfispaoes 
of  the  above  reachability  matrices.  Since  the  overall  realisations 


•S3 


are  both  reachable,  thert  ixiiU  Ti  tad  Tj  tueh  that  T  *  JTJ ,  TJ]' 
u  nonsingular  with  T\  and  3j  satisfying  tht  abort  condition*. 

Wt  mmmifiu  then  with: 

Thtartm  7:  An  invertible  transformation  T  can  always  be 
found,  bringing  the  extended  system  2,  of  order  2n  to  the  cycli¬ 
cally  augmented  form  £M(S)  of  the  same  order  if  it  it  reachable 
and  if  the  reachability  matrices  JJ(Aj;  [B,o,  A,B,i])  and 

*(a*; 

\B, i,  AiBao]),  both  have  rank  at  matt  equal  to  n. 

Theorem  f:  Tht  realisation  of  (^0(1),  Bi(x)]  can  always  be 
augmented  so  that  the  extended  system  satisfies  the  conditions 
of  theorem  4. 

Remarks: 

1.  The  minimal  adiabatic  realisations  completely  specify  the 
I/O  behavior  of  the  alternating  system,  and  may  therefor* 
lead  to  new  canonical  representations  for  such  systems. 

2.  Other  identification  methods  for  periodic  systems  exist. 
On*  can  ‘stagger*  the  impulse  responses,  by  looking  at 
every  fV-th  sample,  for  which  the  system  looks  like  a  time 
invariant  one.  However,  the  solution  for  the  individual  real¬ 
isations  in  (£(, . . .  ,£jv)  require  difficult  nonlinear  aquation 
solvers.  Furthermore  since  the  data  samples  with  such  a 
schema  are  not  ‘convoluted”,  a  large  number  of  data  needs 
to  be  collected  (roughly  2nN)  before  the  system  start*  to 
unfold.  The  scheme  presented  here  already  presents  a  lot  of 
information  about  the  system  after  2n  step*,  and  is  there¬ 
fore  more  "holographic* . 

7.  Examples 

Example  X.  Let  A,-,Oy>0  =  e.o.ee.oVo*,...,  and  «  4,e4, 

e4,eo4,o*4 . The  adiabatic  systems  are  realised  by 

If  a  and  4  are  not  both  sero  and  c*  is  different  from  a,  then  a 
minimal  realisation  of  tb*  transfermatrix  (£o(*),  J?i(x)]  in  ob¬ 
servability  canonical  form  [5j  is 

Since  the  order  of  this  realization  is  even,  we  check  first  the  rank 
conditions  on  the  test  matrices 

Hence  choeing  Ti  from  span  |-e,  1]  and  Tj  from  span  (-«,ej  leads 
to  a  transformation  (after  introducing  suitable  parameters} 


which  in  turn  transforms  to  tb*  above  realisation  to 


f  0  -h/»!  ] 

[tl(.-e*)  0  ] 

[  -**,/n  0  J 

[*/!(•  -  **)*»]>  -!/[(» -«*)*,] 


Tb*  period-2  realisation  is  now  read  out  by  inspection.  Suitably 

reparamstrised,  we  find 

{'.WO.  (a/r,4t/r,e/t) 

V  in  tb*  above  example  we  have  e*  m  a,  then  tb*  adiabatic 
realisations  are  (c,c,l)  and  (e,4,l).  This  bads  to  a  reach  abb 


realization  (c,(c,4],l),  provided  that  c  and  4  are  not  both  0,  of 
the  (£o(*),  As  its  order  is  odd,  augmentation  is  required. 

The  augmented  system  is 


The  test  matrices  are 

**°  ~  [  s'  *4 +  y*i]  ’***“[*,  «  +  r*i] 

For  i]  «  0,»i  *  *  and  y  =  —  e  for  instance,  the  rank  of  both  test 
matrices  is  1.  “Wring  then  Ti  *  ft|0, 1],  and  Ti  *  «»[-*,  e],  the 
cyclically  augmented  realisation 

[ J„,„  'T ].[Y 

results,  from  which  tha  (reparametrised)  period-2  system  (r,  t,  r/t), 


(e*/r,4t/r,e/f)  follows. 

Example  2.  Let  *  2,2, 1  +  «,  1  +  «,  1  +  «*,  1  +  o*,  1  +  a*, 

. . .  and  »  1  +  4, 1  +  a*.  1  +  o4, 1  +  •*»,  1  +  o*4, 1  +  - 

The  adiabatic  systems  have  realizations 


!St][  >  ll 

-1  «  1  j  [  1  +  0  J 

S  S  5]f  1[ 

-«  •  1  J  [  l  +  o4  J 


1  0  0  j  and 


1  0  0] 

Extending  with  a  first  order  no-obeervable  state,  the  system 


0 

0 

— • 
*! 


1  0 
0  1 
•  1 
**  *s 


2 

2 

l  +  o 
*1 


1  +  4 
1  +  o4 


[lOOo] 


realizes  the  augmented  adiabatic  system  [ifo(*),  &i(*)]-  It  can  be 
checked  that  the  choice  *j  *  *1  —  *s  =*  0,  y  *  —1,  *\  =  i,*j  * 
— 1  gives  a  system  for  which  the  matrices  R, 0  and  R,i  have  rank 
2  (if  a  differs  from  1).  The  left  nullspace  of  R,i  is  spanned  by 
[o,0,  -l.o  -  1]  and  [0, 1,  -1,0],  while  the  left  nullspace  of  JLo  b 
spanned  by  [l, -1,0,0]  and  [o,0,-l,l  -  a].  The  special  choice 
forT: 

101  o-i 

0-221  _ 

•  0  -1  1  -  a  |  2(o  -  1) 

-2  2  0 

yields  then  an  equivalent  cyclically  augmented  realizaton,  from 
which  the  period-2  system  can  be  identified  by  inspection  as: 
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ABSTRACT 

Periodic  discrete  time  systems  are  analyzed.  In 
particular,  we  investigate  the  Invariants,  Parameteri- 
cacions.  Canonical  Forms,  and  Realization  from  input/ 
output  data  for  such  systems.  It  was  found  that  the 
classical  realization  theory  for  time  invariant  systems 
carries  ovjr  very  nicely  to  such  systems.  A  novel 
definition  for  an  Operational  Transfer  Function  is 
given,  which  is  useful  in  studying  reductions,  reali¬ 
zations,  and  interconnections  of  such  syscems. 


1.  INTRODUCTION 

This  paper  deals  with  periodic  discrete  time  systems  of 
period  N.  To  fix  the  ideas,  a  state  space  realization 
of  such  systems  is  of  the  form 

Vl  “  Ap(k)*k  +  ®p(k)Uk 

yk  ‘  Cp(k)*k  <0) 

p(k)  «  k  mod  N 

The  N-tuple  }  where  I  is  the  triple 

(A[,Bl,CI)  will  refer  to  such  a  realization.  These 
systems  arise,  for  instance,  by  discretization  of 
periodically,  nonuni formly  sampled  continuous  time 
systems,  and  more  general  periodically  switched 
systems.  In  order  to  simplify  the  ideas,  we  shall 
sometimes  look  at  the  special  case  of  alternating 
(i.e.  period  two)  single  input-single  output  discrete 
time  system.  The  main  ideas  for  the  general  case  are 
noc  different,  but  only  more  complex  m  notation. 

While  these  syscems  are  in  many  ways  more  complex  than 
ordinary  time-invariant  systems,  they  hav®  still  much 
more  structure  chan  general  time-varying  discrete  time 
systems  analyzed  by  Kamen  [3],  or  even  the  multi-mode 
systems  described  by  Stanford,  et  al.  (2],  and  Helmke 
11],  and  one  can  develop  a  parameterization  theory  for 
these  systems  which  is  in  close  analogy  to  the  known 
geometric  theory  for  stationary  systems  (Hazewinkel 
[4]). 

In  particular,  the  input-output  behavior  of  such 
systems  is  left  invariant  by  the  transformation  group 
CLn(R)  x  ...  x  GL_(R)  (N  copies),  and  the  orbit  space 
of  the  controllable  systems  is  a  manifold  which  can  be 
decomposed  into  generalized  Kronecker  cells  which  form 
a  cellular  patch  complex.  The  canonical  forms  act  as 
local  coordinate  systems. 

Our  next  main  result  involves  the  realization  of  such  a 
system  from  the  knowledge  of  the  impulse  response 
sequences  (h  ,  ;i>  j ,  j-0,  l} . 

1  »  J 

2.  I/O  EQUIVALENT  TIME-INVARIANT  REPRESENTATIONS 
FOR  PERIOD— N  SYSTEMS 

Some  preliminary  definitions  and  notations  will  be 
given  in  this  section.  Also,  the  observability,  reach¬ 
ability,  and  stability  properties  will  be  discussed. 
The  properties  and  representations  are  the  key  to  the 
realization  given  in  Section  4.  We  shall  discuss  the 
general  case  for  N-periodic  systems  in  this  section. 


Given  the  N-penodic  system  {l  Q,I  j ,  . . .  ,1  }  ,  let  Che 
response  of  the  system  to  a  pulse  occurring  at  instant 
j<N  be  the  sequence  {h  .  ;i>j]  .  The  system  response  is 
readily  seen  to  be  (wheii  [k]  indicates  k  mod  N) 

\.j  '  C(i]Ali-l ]A[i-?]' ’ 'A{ j*l JBI j]  l>j 


Define  che  "Hankel"  matrices  for  this  periodic  system 
as  the  matrices  H.  j  whose  (a  ,b)-element  is  h.  .  _.+j. 
This  matrix  does'3  not  have  the  same  (bloci;  ’llankel 
structure  as  for  time-invariant  systems.  However,  it 
still  allows  a  factorization  in  an  observability  and  a 
reachability  matrix  (as  defined  in  the  time-varying 
case ) . 

H  .  ,  -0  ,itR.  (2) 

j+1  j+1  j 

e.g.  the  a-th  block  entry  in  0-  +  j  and  the  b-th  block  of 
Rj  are,  respectively  J 

[R.]b  *  A[j]A[j-l]---A[j*2-b]B[j*l-b]  (3) 

[0i-l]a  “  C[j+a]A[j*a-l]'  ‘A[j+l]  (4) 

For  fixed  j  in  { 0,  ...,n},  the  derived  sequence 

)vw!‘>0<  is  also  the  response  to  a  unit  pulse  of 
the  'Yol’rowing  augmented  time-invariant  sysiera  of 

order  nN. 

|0  0....  A_ 


°--Vi 


c  -  [c, 

ca  1  1 


with  read-in  matrix  (0, . . ,0,B!  ,0, . . .0] '  where  the 
nonzero  block  B-  occurs  in  the  (j®l)-th  block  position. 
Such  a  time-invariant  representation  of  the  pulse 
response  sequence  (h  . ;  z  > j }  will  be  called  an 
Adiabatic  representation]  The  corresponding  Adiabatic 

Hankel  matrices  H  .  with  (a ,b )-element  h-  •,  will 

have  the  true  Ha'n^tel  structure.  The  Subscript  "ca" 

refers  to  "cyclically  augmented."  The  above  represen¬ 
tation  is,  in  general,  not  minimal.  A  minimal  realiza¬ 
tion  of  the  Adiabatic  Hankel  matrix  H.  will  be  denoted 
AAA  J 

by  (A  .  ,B . ,C . ) . 

J  J  J 

In  order  to  treat  all  h^  j's  at  once,  an  equivalent 
composite  system  (the  Cyclically  Augmented  System)  of 

Nn  states,  lta  inputs,  and  p  outputs,  is  defined  as  che 
realization  (A „,B_.,C„)  where.  A„  and  C  .  are  as  in 

/ .  \  » •_  C  ■  C  A  C  8 

(2),  and  defining  a  Bc4~matrix  as 

Bq  0  ....  0 

B  -  0  Br--‘  0  (6) 

C  A 


Letting  H.(z)  denote  the  Z-transform  of  the  shifted 
sequence  fh.+ .  .  ; k> 0}  ,  then  the  transformer  ion  of  the 
cyclicslly  au^diinted  system  is  simply 

Hcj(z)  -  {H0(z),Hj(z),...,Hn_1(z)}  (7) 

The  H;(z)  are  the  trsnsfer  matrices  of  the  Adiabatic 
systems,  and  it  follows  from  Che  previous  discussion 
that  they  are  realized  in  a  nonminimal  way  by 
(ACa,[0,  ...,0,B'  0,  ...,0r,Cc<),  the  nonzero  element  in 
the  B-matrix  occurring  in  the  (j*l)sc  block  position. 

Remark* . 


1.  Dually,  we  can  also  work  with  an  equivalent 
(nN,m,Np)  system,  thus  treating  the  periodic 
system  as  an  equivalent  stationary  Np-oucput  and 
m-input  system. 

2.  Classical  realization  theory  for  multivariable 
time-invariant  systems  enables  us  to  find  a 
minimal  realization  (F,G,H)  for  the  above  Hankel 
matrix.  This  minimal  realization  is  Chen  Che  key 
to  the  rest  of  our  development.  In  particular, 
since  rhe  equivalent  stationary  system  captures 
all  of  the  input-output  information  of  the 
periodic  one,  so  will  its  minimal  realization 
(F,G,H).  A  parameterization  for  the  periodic 
system  follows  then  directly  from  the  parameter¬ 
ization  cc  the  multivariable  system  (F,G,H).  At 
once  we  see  chat  even  a  scalar  periodic  system 
lead6  to  multivariable  equivalent  systems.  The 
restriction  to  scalar  aystems  mentioned  at  the 
onsec  is  thus  noc  restrictive,  but  permits  simpler 
notation  and  examples. 


We  indicate  some  particular  results  which  will  be 
useful  in  th>  realization  problem: 


Theorem  1.  The  minimal  realizations  (Aj.B^C^)  of  the 

Adiabatic  Hanxel  matrices  H  have  the  property  chat 

—  N  ^ 

det (zI-A^ )  divides  (z  I-AqAj  .  .  .  A^j)  . 


Proof  (for  N*2).  The  Hankel  macrix  HQ  is  obtained 
the  pulse  response  { h ^  q} .  Its  Z-transform  equals 


Ho(z) 


C,  z  I 


"Vo) 


-1 


Vo 


+  c 


it 


z2I 


-A0Al)~ltBC 


■  [Nq(z2 )*zNj (z2 ) ] /det ( z2 I-AqAj) 


from 


for  some  polynomial  matrices  Nq  and  Nj.  Clearly  then 
the  minimal  realizations  of  and  Hj  have  the  above 
stated  property.  • 


Hence,  if  we  define 

che  following  trsnsfer 

matrix : 

!  2  1-1 

- 

|z  I-A.A-  0  | 

A ,  zl 

H01  *  lco-cx] 

*  0  I 

!  o  *2i-a0Aj 

o 

< 

*4 

then  H0-”01[B0'0]' 

and  Hj-H01 [O.Bj]  '  . 

This  observation  leads  directly  to  the  following 
theorem: 

Theorem  2.  There  exists  an  observable  pair  (Aj_C)_  arnd 
matrices  B.  and  B.  such  thst  (A,Bq,C"),  and  (A,Bj,C) 
realize,  respectively,  the  Adiabatic  transfer  matrices 


Proof.  Let  (A,B,C)  be  a  minimal  (observable  is  suffi¬ 
cient)  realization  for  Hn. ,  then  B  *B [B ’ ,0 ] ’ ,  and 
¥j*B [0, B  J ] 1 .  01  °  • 

The  importance  of  this  theorem  lies  in  its  use  to  find 

the  realizations  for  an  alternating  system  [6].  Given 

the  pulse  response  sequences  {h  .]  and  fh  j}  ,  we  can 

use  the  realizations  of  either'  sequence.'  By  the 
theorem,  these  realizations  can  be  extended  by  addition 
of  uncontrollable  states  if  necessary,  to  observable 
realizations  with  the  same  A  and  C  matrix. 

3.  REACHABILITY,  OBSERVABILITY,  AND  STABILITY 

Definitions: 

•  The  N-per iodic  system  {ig.I  j  * •  •  •  »ru_ [1  l*  *ald  t0 

be  uniformly  p-reachable  (reachable  m  p  steps), 
iff  every  state  can  be  reached  in  p  steps, 

independently  of  the  starting  event  (•  initial 

time  and  initial  state).  The  system  is  said  to  be 
uniformly  reachable,  iff  there  exists  a  p>0,  such 
that  it  is  uniformly  p-reachable. 

•  The  system  is  said  to  be  uniformly  observable 

in  p  steps  iff  the  initial  state  x.  can  be 
uniquely  determined  from  p  consecutive  outputs 

{y . . .+  },  independent  ly  of  the  starting 

C  isie  j.  J  flie  system  is  said  to  be  uniformly 

observable  iff  it  is  p-observable  tor  some  p. 

Theorem  3.  The  period-N  system  j)  is 

uniformly  reachable  iff  the  reachability  matrices  (3) 
have  full  rank  for  all  j.  The  system  is  uniformly 
observable  iff  the  observability  matrices  (4)  have  full 
rank  for  all  j. 


In  fact,  it  is  easy  to  show  that  the  realizations 
of  Hq  and  Hj  maut  be  very  closely  related.  Indeed,  by 
writing  Hj  in  the  form 


Hj  -  [Cj.C0] 


z  I-AjjAj 
0 


r  ao 


:  b, 


z  1  ”A  l A  0 


zli 


-  {c0.Cj : 


z  1-AjAq 


z  I-AqAj 


-1 


z  I  ■ 


'1 


The  first  factor*  on  the  left  also  appear  in  the 
expansion  of  H 


Hq  «  [C0.C,] 


z  I-AjA0 


z  I- 


Vi 


-1 


"1 

zl 


The  proof  is  easily  established  by  a  standard  argument 
[5).  Since  the  Adiabatic  systems  of,  at  most,  order  nN 
provide  an  underlying  time-invariant  structure  in  the 
problem,  at  most  nN  steps,  need  to  be  considered  for 
checking  uniform  reachability  and  observabil ity ,  by 
virtue  of  the  Cayley-Hamil ton  Theorem.  Some  direct 
corollaries  of  the  theorem  are: 

(1)  The  Cyclicially  Augmented  System 

(Ac, ,  BCa ,  Cca)  i»  reachable  iff  the  penod-N 
realization  jl.,I  )  is  uniformly 

reachable . 

(2)  {l  ,1  }  is  uniformly  observable 

(reachable)  iff  {l  .,1 .  ,  . . .  ,1  j)  is  uniformly 
observable  (reachable*,  whence  the  invariance 
of  uniform  observability  and  reachability 
under  a  cyclic  shift. 


CANONICAL  POEM,  PARAMETER IZATION 
AMS  TOPOLOGICAL  STRUCTURE 


(3)  Using  the  backward  propagation,  we  can  write 
the  output  at  time  i  in  terms  of  the  previous 
inputs,  i.e,  we  look  at  {h.  .}  for  fixed  i, 
and  define  the  equiva\fe4it  stationary 

systems  with  the  above  A  matrix  and 
C*[0, . . . , 0, ,0, . . . , 0] ,  the  nonzero  block 
occurring  in  the  i-th  block  position, 

and  B*B(Bi,B| , . . . ,B„  We  then  have  the 

duality  property: 

(l  j,  . . .  ,1  }  is  uniformly  observable 

iff 


4. 


The  first  object  in  this  study  is  to  find  the  transfor¬ 
mations  on  the  realizations  that  leave  the  input-output 
behavior  (i.e.  all  Adiabatic  transfer  matrices  and  the 
periodic  system  "Hankel"  matrices  (2))  invariant. 

Let  {l  ,1  ,  .  .  .  be  a  realization  of  an  N-penod 

system.  Denote  an  element  of  the  group  Cl  (R)*1’, 

N  ” 

denoted  by  G1  for  short,  by  ,).  The 

n  .  ,  ,  ,  w  i  N" i 

group  action  it  defined  by 


d 

N-l  ’• 


is  uniformly  reachable, 


(P 


O’ ' 


.pN_l>  :  ^(Ao,B0*C0) . (Vl,BN-rCN-l^  > 


where  the  "dual"  system  is  .  btained  by  £ime 
reversal  of  the  sequence  of  the  duals  1^  of 
the  realizations  I.,  where  (/-,B  ,C  )“  is*the 
triple  (A',c!,B').1  We  are  thus  led  to  the 
defmitioA:  1  1 


(9) 


^PlV°0  ,P1B0,C0P0  P0%-1  PN-1  ’  P0BN-1  ’  CN-1 PN-  1'  ' 

The  states  transform  as 


{Ej ,r2 , . . .,eq} 


d  ual 


r  d  d  d 

^Z0,ZN-1’"  ,,Z1 


(8) 


Finally,  we  remark  that  if  all  At  are  nonsingular,  as 
for  instance  in  the  important  case  of  the  discretiza¬ 
tion  of  a  continuous  system,  the  criterion  of  Theorem  1 
can  be  simplified  by  virtue  of  the  following: 


Leosa:  If  the  Aj  are  nonsingular  for  all  j,  then  the 

full  rankness  of  one  of  the  reachability  matrices  Rj 
(observability  macnces  Ch)  implies  the  full  rankness 
of  all  others,  and  hence  reachability  (observability). 


As  an  example,  a  siso  alternating  system  {Iq.E.}  will 
be  uniformly  reachable  iff  the  stationary  systems 
(a  j ,  A  (b  j , A .  ,bp ] )  and  ( Aq, A^ , [ b^ ,Aq ,b ^ ] )  are 
reachable.  If  the  system  is  uniformly  reachable,  no 
more  than  2n  steps  are  required  to  reach  any  desired 
endstate.  If  the  product  AqA j  is  nonsingular,  then 
(A  Ag.tbj.A  ,b0l)  and  ( A  , Aj , ( bfl , A  ,b  ) )  are  either 
both  reachable  or  both  no- . eschable .  By  applying 
inputs  before  0,  one  gets  the  reachability  relation  at 
time  0: 


where  Rj“[b j , A, b^, AjAgb j , . . . ]  is  the  time-varying 
reachability  matrix  [3).  Observation  of  the  output 
sequence  after  time  0,  with  no  input  applied,  leads 
then  to  the  observability  relation: 


[y0.yj.y2...J’  -  oQx0 

where  0»( c’A^cj .AqAJCq, '  is  the  time-varying 
observability  matrix.  Similarly,  we  construct  the 
reachability  and  observability  matrices,  Rq  and  0, , 
relating  to  the  reference  time  1.  The  products  OqRi 
and  OjRp  are  then  the  alternating  (period-2)  Hankel 
matrices  defined  in  (2). 


We  also  have  the  following  important  stability  theorem: 

Theorem  4.  The  N-period  system  (0)  is  stable  if  the 
eigenvalues  of  the  product  AQAj...Ajj_j  have  modulus 
less  than  1. 


Proof.  The  convergence  properties  of  the  periodic 
systems  are  determined  by  the  convergence  properties  of 
the  equivalent  t ime- invariant  system  ( Aca , 8CI ,Cc a) . 
The  latter  is  completely  determined  ty  the  character¬ 
istic  polynomial  det  (  z^I-AjAjAj  . .  .  A^j-O.  • 

The  problem  with  this  approach  is  that  the  resulting 
time-invariant  system  has  order  Nn  if  n  is  the  order  of 
the  individual  realizations  £..  The  original  periodic 
system  is  only  of  n-rh  order,  so  that  a  "hidden  modes" 
phe  nome non  occurs. 


*Nk 

*Nk»l 


— >  P0XNk 
— >  PiXNk*i 


1  *  1 . N-l 


(10) 


The  following  property  is  readily  shown: 

Theorem  5.  Equivalence  of  State  Space  Representations. 
The  product  group  G1  tR)^  action  on  the  set  of  period-N 
systems  leaves  the  I/O  properties  invariant. 

Once  the  symmetry  group  (i.e.  the  group  whose  action 
leaves  the  I/O  behavior  invariant)  is  established,  we 
can  look  at  the  question  of  canonical  forms:  Any 

property  of  the  original  system  can  be  described  as  a 
map  from  the  set  of  systems  I,  to  some  suitable  set  S. 
After  introducing  canonical  forms,  the  study  of  the 
original  function  f  is  then  replaced  by  the  study  of 

some  "simpler"  function  f:C  >  S,  such  that  f»f  o  it  , 
where  it  is  the  canonical  projection  *  :Z  — >  C  on  the 
set  of  canonical  forms. 


For  notational  simplicity,  the  rest  of  this  section 
will  be  restricted  to  alternsting  systems.  The  above 
development  (N«2)  should  give  enough  insight  to  realize 
that  the  general  principles  remain  the  same.  Canonical 
forms  for  the  uniformly  reachable  systems  are  obtained 
by  the  usual  Kronecker  selection  procedure,  i.e.  among 
the  2n  col  urn:  s  of  Rq,  selecc  n  linear  independent  ones, 
which  form  a  basis  { 6 _}  for  the  state  space.  In 
particular,  a  unique  "nice"  selection  may  be  chosen 
according  to  the  Young  or  "crate"  diagram.  Similarly, 
let  {8,}  be  another  basis,  chosen  by  a  nice  selection 
among  the  columns  of  Rj.  Now  express  the  system  with 
respect  to  the  basis  which  is  alternating  becween  •{  B Q } 
and  { B  ^}  ,  e.g.  Aq  is  represented  by  the  new  matrix 
whose  i-th  column  is  the  representation  in  terms  of  the 
basis  { B  j }  of  Aq  operating  on  the  i-th  basis  vector 
from  the  other  basis  { B ^ . 

The  effect  is  that  the  new  representation  is  of  the 
form  bQ«bj"( 1, 0, . . . ,0] ' .  (By  assumption  of  reach¬ 
ability,  neither  bQ  nor  b,  are  zero.)  If  vt(k)  denotes 
the  position  of  the  k-th  basis  vector  from  Rj ,  then  we 
refer  to  the  sequences  w  »{v  (l),...,v  ( n )}  as  a  multi¬ 
index.  The  k-th  column  of  the  new  A  matrices  are 

Are  -  Rre 
0*k  *0  vl(k)*l 

.r  r 

l*k  Kl'vO(k)*l 

However  straightforward  the  previous  extension  of  the 
known  scheme  may  be,  a  particular  nice  form  is  obtained 
as  follows  if  Aj  is  nonsingular.  Search  the  colioans  of 
R0  in  their  natural  order,  i.e.  from  left  to  right,  and 


reordered  in  chains,  as  in  the  usual  "scheme  II"  search 
[5],  Now  observe  Chat  if  A,  is  nonsingular,  then  the 
result  of  /j  operating  on  the  above  basis  is  also  a 
basis,  and  in  fact,  each  of  these  new  basis  vectors 
will  be  a  column  in  Rj,  except  perhaps  for  the  last  new 
basis  vector.  In  that  case,  it  may  be  substituted  for 
b|  as  new  last  basis  vector.  Note  that  this  all  corre¬ 
sponds  to  a  scheme  II  search  in  Rj,  but  STARTING  AT 
Ajbn.  It  follows  that,  unl^sj  there  was  a  full  length 
(“nl  chain  j AQb j , . . . , (AqA, )  1 AQb ^ }  in  R0  (which  only 

happens  if  bg  is  aero),  the  operator  Aj  is  represented 
by  I.  The  bj  vector  is  full  in  general.  The  pair 
(Ag,bg)  has  a  canonical  controllability  form  [5] 
representation.  In  the  latter  special  case,  it  follows 
that  Ag  is  represented  by  a  cyclically  down  shifted 
identity  matrix,  and  Aj  by  a  cyclically  down  shifted 
right  companion  (controllability  canonical  form) 
matrix.  The  new  bg  (obviously)  remains  rero,  and  the 
new  bj  is  a  full  vector.  As  the  Cg  and  Cj  have  no 
particular  structure,  there  are  4n  free  parameters  in 
this  canonical  form.  By  analogy  to  the  stationary 
realizations,  we  shall  refer  to  this  form  as  the 
controllability  canonical  form. 

Noce  chat  because  the  search  extends  over  2n  columns, 
the  new  A  matrices  will,  in  general,  not  be  in  the 
usual  companion  form  themselves,  unless  the  system  is 
uniformly  reachable  in  n  steps,  but  then  this  is  also 
the  case  with  the  time- invariant  multivariable  systems. 
In  fact,  it  is  exactly  because  of  such  a  reduction  from 
the  time-varying  to  the  multivariable  time-invariant 
case  that  all  the  copological  properties  of  these 
systems  are  expected  to  carry  over.  In  particular,  for 
multivariable  systems,  we  have: 

Theorem  6.  The  orbit  space  of  the  reachable  systems  is 
an  analytic  manifold,  which  can  be  decomposed  into 
generalized  Kronecker  cells  which  form  a  cellular  patch 
complex.  The  state  space  canonical  forms  act  as  local 
coordinates . 

The  number  of  canonical  forms  that  is  required  to  cover 
che  space  of  all  reachable  alternating  systems  is  also 
equal  to  the  number  of  pairs  of  nice  multi-indices  that 
can  be  chosen. 

5.  OPERATIONAL  TRANSFER  FACTION 

The  essential  ingredient  is  the  introduction  of  a 
sampling  operator,  »^,  taking  sequences  into  sequences, 
defined  via 


— >  {yx  ;  i>°} 


yNk*i  ’  °  for  1  e  (1. ...,N-lj 

2 

Clearly,  t  that  1,N  18  *  projection  operator. 

Letting  luz)  denote  the  usual  Z-transform  of  the 
sequence  u(z),  then  induces  an  operator  in  the 
Z-domain  which  we  shall  denote,  with  a  slight  abuse,  by 
the  same  notation  *  .  Note  that  then 

N 

*Mz  ^  1  •  0  for  i  c  { 1 , .  .  , , N— 1 } 


The  space  of  formal  power  series  in  z”^  can  then  be 
decomposed  into  N  orthogonal  subspaces,  each  of  which 
induces  m  turn  another  projection  operator.  The  set 
of  subspaces  and  the  set  of  projection  operators  are 

isomorphic  structures.  Thus,  define  by  z  *»„zl 

for  i  t  {l,...,N-tJ.  It  follows  at  once  that  these 


though,  the  operator  algebra  will  be  a  noncommut  at  ive 
one.  The  union  of  all  these  subspaces  is  the  whole 
apace,  so  that  we  have  the  relator 


z  V  +  1  V  4 


^  -N*  1  N-l 

♦  z  *  z 
n 


or,  equivalently 

N-l  N-l  N-2  N-2  N-l 

Z  *  Z  8n  ♦  Z  *HZ  ♦  ...  ♦  ZTjjZ  ♦  HjjZ 

Thus  the  above  can  be  formalized  as  follows:  Period-N 
systems  of  order  n  can  be  represented  by  an  n-ch  order 
realization  (a(»jj)  ,B(»  )  ,C(e  ,  whose  coefficients  are 
m  che  multivariable  polynomial  quotient  ring 

Rf4l)  ;  ^f1 . •)  • 

N  rN  N  ij' 

The  periodic  state  space  realization  equations  (0)  are 
then  transformed  to 


•  Vi*’- 


for  1  *  N-l 


(0)  ,  (N-l)  „  (N-l) 

*N  X  ’  Vi’S  X  *  BN -1%  U 
( i)  _  „  (i) 

*N  y  “Vn  X 

Upon  Z-transforming,  we  find 

*s^*4^X(z)  *  A.x^'^X(z)  +  B  itf,l^U(z)  for  i  *  N-l 
N  l  N  l  N 

ze^0)X(z)  -  zNx0  -  An_jxJN"1)X(z)  ♦  BN.1x1JN'1)U(z) 

*£1>Y(z)  -  Ci»^l)X(z) 

Note  the  appearance  of  che  initial  condition  (xg)  term. 
Adding  the  left  hand  sides,  taking  account  of  the  above 
relation  between  the  projection  operator,  yields 

v,  .  N  ,  (1)  (N) \  ,  . 

zX(z)  -  z  xQ  •  a(xn  . )X(z) 

♦  B(*£n,...,i^N))UU) 

Y(z)  •  C(«|^1),...,*^N))X(z) 

Hence,  we  get,  assuming  zero  initial  conditions,  and 

substituting  by  che  combination  z  the 

Operational  Transfer  Function  (OTF) 

H(z,»n)  -  C(z,xN)[zI-A(z,*N)]’'1B(t,itN)] 

where  now  very  simply: 


A(,*V  "  Vn  *  Al*  V  4 


B(z,xN)  -  B0xn  ♦  BjZ.jW^  ♦ 


N-i'-n+i’Vn-i 


.  -N+l  N-l 

bn-i*  V 


operators  are  all  generated  by  z  and  *  ,  clearly 

N 


C{t-V  *  Vn  4  ciMV  4  —  W-nmVn-i 

In  order  <o  illustrate  che  ideas  for  period-2  systems 
(N“2),  we  have:  t»+z(l-e  )“x.  The  equation  *2R+l“ 

A0x2k+B0uk  11  t,,en  transformed  to  t(l-m  )X(z)*Aq1> X(z)* 
Bj*U(z).  Similarly,  the  complementary  equation 
transforms  to  nX(»)“Aj ( W  )X(z)*B j(  1-*  )U(z) . 

Addition  yields  the  form  X(z)“( zI-A(* ))  *B(»)U(z), 

where  A(»  )“Agt *A.  (1-x ) .  Using  the  noncommutat  ive 
relation,  this  Formalism  is  very  helpful  in  deriving 
all  sorts  of  results  and  transfer  function  operations. 


For  penod-2  systems,  some  of  Che  ideas  on  Operational 
Tranafer  Matrix  Reductions  were  explored.  For 
instance,  connections  (parallel,  aeries,  and  feedback) 
can  be  performed  with  the  same  formal  rules  as  for 
stationary  systems,  as  long  as  the  nonconmutadvity  is 
taken  into  account  during  the  reduction. 

We  shell  here  also  explore  the  possibility  of  connect¬ 
ing  systems  of  DIFFERENT  periodicity.  So  one  system 
may  have  the  OTF  and  another  G(z,«  ).  The 
series  connection  is  then  simply  G(z (z ,»N) . 
Clearly,  the  combination  involves  now  three  generators: 
z,  and  *  .  So  one  needs  to  define  the  composite 
transfer  matrix  as  a  rational  division  ring  (Noncoremu- 
tative  Field)  extension  of  the  polynomial  ring  with 
three  generators  (z ,*  ,s  ).  Clearly  additional  commu¬ 
tation  rules  (relators  in  the  division  ring)  need  to  be 
invoked.  This  work  is,  as  of  this  writing,  in  progress 
and  will  be  reported  in  the  final  version  of  this 
paper. 

Finally,  we  report  chat  some  interesting  realization 
related  properties  can  be  developed  from  within  the  OTF 
framework  as  well.  In  particular  (for  N*2): 


5.2  The  Observability  Problem  via  the  OTF 

Here,  the  inputs  are  zero,  and  a  nonzero  initial  condi¬ 
tion  Xq  is  assumed.  The  system  output  is  given  in  the 
transform  domain  by 

Y(z)  -  C(«)[zI-A(it)]-1x0 

■  C(*  )[  I-z  ^(w)]  1  z  ]xQ 

■  C(*)[l+z  lA(x)*[z  1  (An  ))2». .  .]  z  'xQ 

•  C(it)z  !xq  ♦  C(*)A(l-n)z  2xq  ♦  ... 

-  O(x)X0(z) 

where  0(x  )“0q”  +0  j  ( 1  — ir ) .  As  for  the  reachability 
problem,  the  matrix  0(it)  is  said  to  have  full  rank  if 
both  Oq  and  0,  have  full  rank.  The  condition  for 
observability  follows  them  as  from  the  full  rankness 
of  0(»). 


5.1  teachability  Problem  via  the  OTF 


6.  REALIZATION 


With  zero  initial  conditions,  and  input  sequence  { } , 
the  Z-transform  of  the  state  sequence  is  given  by 

X(z)  «  [zI-A(it)]_1B(x)U(z) 

-ll-l 


*~l[ 

-lr 


z  [l-A(*)z  *J  *B(x)U(z) 


[l+A(«)z  1  +  ...*(a(»)z  <)KB(*)z”K*...]u(z) 

Noting  that  the  coosnutation  of  *  and  z“*  involves  an 
involution,  i.e. 

Z  JA(s )  «  A(l-x  )z  1  , 
we  get  the  series  expansion 

zX(z)  -  [B(x)+A(x)B(1-w)i-1*a(x)A(1-it)B(x)*...]u(z) 

"  [B(*),A(*  )B(l-»  ),A(x  )A(l-x  )B(x) ,  . ..]  rU(z) 

z_1U(z) 

-2, 


-Ilk. 


U(z) 


*  R(»)U(z) 


The  operator  reachability  matrix  decomposes  into  two 
parts : 

R<*  )  -  xRq  ♦  (1— s  )Rj 

where,  in  terms  of  the  system  components: 

• 

R0  "  [Bl,AlB0,AlA'BrAlA0AlB0,,,’J 

Ri  ’  |,o,Ao,rViVViVr‘”1 

As  the  operators  »  and  l-»  select  complimentary  parts 
of  the  vector  t Hz),  we  find  for  the  condition  of  reach¬ 
ability  that  both  Rq  and  Rj  should  have  full  rank.  We 
ahall  say  that  then  the  operational  reachability  matrix 
R(w)  has  full  rank,  so  that  the  usual  criterion  for 
reachability  is  retrieved. 

For  instance,  upon  identifying  the  coefficients  of  z-^ , 
we  obtain 

\  *  WlVo  *  WlVl  *  A1B0u2  *  B1u3 


In  this  section  we  show  how  the  above  results  can  lead 
to  a  relatively  simple  realization  algorithm  for 
periodic  systems.  We  shall  assume  that  the  order  n  of 
the  minimal  N-penod  system  is  known.  There  is  then  an 


underlying  uniformly  reachable  aystem  {l 
order  n.  The  realization  is  given  is  4  steps: 


O'  ’ 


of 


Step  1.  Let  the  pulse  responses 
{h.  M  .  ;i>N‘  be  collected, 
the  *  Adiab: 


Step  3. 


'mV"  M| 


this 


Step  2. 


data, 

iabatic  Henkel  matrices  Hj ,  j*0,..,,N-l 
are  formed.  There  are  now  two  possible 

routes:  realize  each  Adiabatic  system 

separately,  or  realize  the  composite  system 
with  transfer  matrix  [  HQ(z) , . . . ^ ( z ) ]  . 

Obtain  a  minimal  realization  (a  , [ B^  , 

.  .  .  ,B  ,],C  )  of  the  system  wi\:h  'ebrapo'site 
transfir  matrix  [ Hq(z) , . . . ,H„  , ( z)] ,  where 
Hj(z)  is  the  transfer  matrix  of  the  Adiabatic 
system  (A^B^C^). 

There  are  two  ways  in  which  this  can  be 
obtained.  The  first  is  to  reduce  the  compo¬ 
site  transfer  matrix,  and  obtain  a  minimal 
realization  of  it,  by  standard  multivariable 
techniques  15).  The  second  method  consists 
in  first  obtaining  minimal  realizations 
( Ai , ®i , Ci )  of  the  Adiabatic  systems.  These 
minimal  realizations  may  be  of  different 
dimensions.  However,  by  Theorem  1,  there 
exists  a  maximal  characteristic  polynomial,  of 
order  q  say,  in  the  sense  that  the  character¬ 
istic  polynomials  of  the  other  realizations 
divide  it,  and  each  nonmimmal  Adiabatic 
realization  can  be  extended  by  adding  a 
nonreachab le ,  but  observable  subsystem  so  that 
all  extended  realizations  of  the  Adiabatic 
subsystems  have  the  same  order  (*q),  and  the 
same  A  and  C  matrix.  The  composite  realiza- 

tion  (Aa,[Bm  0,....B  N  , J,cj  th«"  the 

desired  fornf  and  is  minimal  since  (A_,C )  is 


observable,  and  at  least  one  of  the  Bn  forms 
together  with  *m  •  reachable  pair.  ’ 

Extend  the  realization  of  order  q,  obtained  in 
Step  2,  to  one  whose  order  is  a  multiple  of  N 
by  adding  a  nonobservable  but  reachable 
subsystem.  Indeed,  since  the  minimal  system 

(Aml[Bm,0*----B»tN-l1*CJ  ,nd  th* 
augmented  systeA  tA  B  t  )  r«,iite  the 


same  transfer  utru,  and  since  the  cyclically 
augmented  system  it  uniformly  reachable  by 
aaaumption,  the  latter  mutt  be  an  augmentation 
(A^.B^.C^)  of  (An#lB  * 
nonobtervable  but  reacnable  subsystem  oT  order 
Nn-q.  Thia  implies  the  existence  of  matrices 
X,Y,Z0, ...,ZN_2  so  that 


'Bm0’ *  * "* Bm,N-l 
2fi*  '  ‘  *,ZM-1 


K  o] 


is  similar  to  the  cyclically  augmented  system. 
This  augmentation  must  not  impair  the  reach¬ 
ability  of  the  realization.  The  necessary 
reachability  of  the  subsystem  implies  that 
none  of  the  rows  of  the  matrix  [X, Zq, . . . , ZN  .  1 
can  be  zero.  This  follows  easily  by 
contradiction.  If  [X,Zq, . . . ,Z„  j ]  had  a  zero 
row,  then  the  realization  coulo  be  partitioned 


have  rank  less  than  n,  it  is  possible  to  find 
n  linearly  independent  row  vectors  in  the  left 
nullspacea  of  the  above  reachability  matrices. 
Since  the  overall  realizations  are  both 
reachable,  there  exists  Tj  and  Tj  such  that 
T*  iT'.Tjj  '  is  nonsingular  with  Tj  and  Tj 
satisfying  the  above  conditions. 

We  susnarize  then  with 

Theorem  7.  An  invertible  transformation  T  can  always 
be  found,  bringing  the  extended  system  (F,g)  of  order 
2n  to  the  cyclically  augmented  form  I  (5)  of  the  same 
order  if  it  is  reachable  and  if  C?he  reachability 
matrices  F( F2  ,  Ig_,Fg  ])  and  F(  F?  ,  [  gj  ,Fg  ])  both  have 
rank  at  moat  equal  to*  n. 


Theorem  8.  The  realizat 
be  augmented  so  that  th< 
conditions  of  Theorem  4. 


lization  of  [H  (z),H  (z)]  can  always 
it  the  extended  system  satisfies  the 


Bm0’  "‘'Vn-I 


iXl  X2  Y1  ,20*  *  * -*ZN-1 

|0  0  v2  j  j0,...,0 


which  has  the  unreachable  subsystem  [YjiO.C^l. 

The  X,Y,  and  Z  are  chosen  so  that  the 
reachability  matrices 

R(Ae;lBeO,AeBel*4'*,Ae  Be,N-l^* 

R(Ae^Bel-AeBe2 . . 

Ae  •  t  ®e ,N-1  ’ AeBeO ’  ’ '  ‘  *  Ae  ®e,N-2^ 
all  have  rank  less  than  n. 

Determine  the  similarity  transformation  that 
transforms  the  extended  system  (a  , [ B  ,  .... 
B_  to  a  cyclic  form.  For*  notat  ional 

convenience,  we  shall  again  discuss  the  latter 
for  alternating  (i.e.  period-2)  systems.  The 
ideas  for  penod-N  systems  are  similar. 

The  reachability  matrix  for  the  cyclically 
augmented  matrix  has  the  atructure 


®0*  °’  0>  A0B1*  A0A1B0’  0 

o,  Bj  ,  AjBq,  0,  0, 


AiAoV 


whereas  Re  has  entries  in  all  positions  in 
general.  The  desired  similarity  maps  R#  into 
TK(*RC|.  Partitioning  T  into  [Tl.Tl]',  it  is 
seen  chat  the  zero  locations  in  the  above 
equation  leads  to  the  identities: 

Tl*(A.2;[»el •**»*<>])  *  0 

T2R^A«2:[B,0*A,Bel^  “° 

where  the  reachability  matrices  extend  to  n 
(block)  colunns  only,  and  are  thus  square  for 
sing':  input  periodic  systems.  On  the  condi¬ 
tion  that  the  test  matrices 


and 

R.rR(A.2:<»,l-AeB,0»i 


Some  simple  illustrative  examples  are  given  in  [6]. 

Remarks . 

1.  The  minimal  Adiabatic  realizations  completely 
specify  the  I/O  behavior  of  the  alternating 
system,  and  may  therefore  lead  to  new  canonical 
representations  for  such  systems. 

2.  Other  identification  methods  for  periodic  systems 
exist.  One  can  "stagger"  the  impulse  responses, 
by  looking  at  every  N-th  sample  for  which  the 
system  looks  like  a  time-invariant  one.  However, 
the  solution  for  the  individual  realizations  in 
(£.,...,!„)  require  difficult  nonlinear  equation 
solvers.  Furthermore,  since  the  dats  samples  with 
such  a  scheme  are  not  "convol uted , "  a  large  number 
of  data  needs  to  be  collected  (roughly  2nN)  before 
the  system  starts  to  unfold.  The  scheme  presented 
here  already  presents  a  lot  of  information  about 
the  system  after  2n  steps,  and  is  therefore  more 
"holographic ." 
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